Hybrid Architecture for Automatic Video-Based Fall Detection Using YOLOv11, MediaPipe Pose, and LSTM Networks
Juan M.Triviño1Email
Andrés F.Lasso1Email
Carlos M.Paredes1✉Email
Victor M.Peñeñory1Email
1LIDIS,Faculty of EngineerinUniversidad de San Buenaventura760035CaliColombia
Abstract
Falls represent one of the leading causes of injury and loss of autonomy among older adults worldwide. This work proposes a lightweight hybrid deep learning architecture for automatic fall detection, combining person detection with YOLOv11m, human pose estimation with MediaPipe, and temporal analysis using a long short-term memory network. Evaluated on the Le2i dataset, the model classified frames into normal activity, fall in progress, and person on the floor, achieving an overall accuracy of 99.23% and a weighted F1-score of 97.38%. The system matches or outperforms recent hybrid and transformer-based approaches while requiring lower computational resources, demonstrating its suitability for real-time embedded or home monitoring applications. Future work will focus on performance in uncontrolled environments and optimization for edge computing.
Keywords
Video-based fall detection
spatio-temporal analysis
YOLOv11
MediaPipe
LSTM
elderly monitoring
Juan M. Triviño , Andrés F. Lasso , Carlos M. Paredes and Victor M. Peñeñory: These authors contributed equally to this work.
A
Introduction
Falls are a major public health concern among older adults, affecting a significant proportion of those over 65 and increasing with age. They lead to health complications, hospitalizations, loss of independence, and reduced quality of life, while placing a substantial burden on healthcare systems Mercy2024.
Given the high prevalence and serious consequences of falls among older adults, there is a need for systems that allow continuous monitoring to help mitigate the risk of severe incidents Kaur2025. Fall detection devices, which can be wearable or non wearable, offer a practical solution. Wearable devices provide high accuracy but may cause discomfort and require proper placement, while non wearable systems reduce user dependence and include vision based and environment based approaches, each with limitations related to lighting, privacy, multiple users, and adaptation to new environments 10.3389/frobt.2020.00071.
Both wearable and non wearable fall detection devices rely on analytical methods to determine whether an event is a fall. Early solutions used simple threshold-based criteria, while machine learning classifiers such as support vector machines, k-nearest neighbors, decision trees, and random forest improved accuracy at the cost of higher computational demands. Recently, deep learning approaches have gained relevance for their robustness and adaptability. Among these, long short-term memory networks capture temporal dependencies, convolutional neural networks enable efficient object and video analysis, and transformer-based architectures integrate spatial and temporal information with strong performance inproceedings, Liu2025, Nez-Marcos2024].
Many existing fall detection approaches rely on complex architectures, multiple sensors, or limited datasets, revealing the need for solutions that balance accuracy, generalization, computational efficiency, and practical applicability. To address this, a hybrid system was developed combining person detection with YOLOv11m, pose estimation with MediaPipe, and temporal analysis using a long short-term memory network. The system was evaluated on the Le2i dataset, with video frames labeled as normal activity, fall in progress, or person on the floor and organized into sequences of 30 frames. The proposed model achieved performance comparable to recent hybrid approaches.
The main contributions of this work are summarized as follows:
The study proposes a lightweight hybrid architecture that combines person detection (YOLOv11m), human pose estimation (MediaPipe), and temporal modeling with LSTM networks, enabling the capture of spatial, postural, and sequential patterns associated with fall events.
A structured preprocessing and labeling strategy is introduced for the Le2i Coffee Room dataset, converting video-level annotations into frame-level labels and generating fixed 30-frame sequences that strengthen temporal learning and improve model robustness.
Experimental evaluation shows that the proposed approach achieves high accuracy and weighted F1-scores while requiring substantially lower computational resources compared to recent transformer-based or hybrid architectures.
The document is organized as follows: In Section2 the methods investigated so farto address the described issue are reviewed, Sect. 3 presents the proposed methodology, Sect. 11 reports the experimental results, Sect. 12 provides the discussion and comparison with state-of-the-art works, and Sect. 13 presents conclusions and future research directions.
Related works
A
Along with deep learning algorithms, a recent trend in studies is the use of multiple datasets for the training process. Among the most notable are UP-Fall, UR-Fall, and LE2i. Table 1 summarizes the most representative methodologies published between 2024 and 2025, detailing the technologies employed, their architectural approaches, and the main results obtained across different datasets, such as UP-Fall, UR-Fall, and Le2i. In general, hybrid methods that integrate convolutional networks with transformers or attention modules achieve accuracy values above 96%, highlighting the potential of deep learning in this area.
A
begin{table}[htpb]\centering\caption{Summary of fall detection methodologies with their technologies and results}\label{tab:fall_detection_methods}\begin{tabular}{p{1cm} p{4.5cm} p{4.2cm} p{2.2cm}}\topruleStudy & Methodology & Technologies & Accuracy (%) \\\midruleLiu2025 & Hybrid model (Modified YOLOv8s + AlphaPose + BCIoU loss) & Object detection, pose estimation, sparse convolutions, BCIoU optimization & +4.3% accuracy / +4.5% F1 / +37.5% speed (Le2i) \\\citep{Nez-Marcos2024} & Deep Learning (Transformer + CNN + GRU with feature fusion) & Video, spatiotemporal extraction, pyramidal network & 95.45% (UR-Fall) / 99.17% (UP-Fall) \\Khan2025 & Deep CNN (RBNet with self-attention + TSA optimization) & Residual networks with self-attention, Tree Seed algorithm & 93.2–92.5% (Soonchunhyang Univ. Dataset) \\Raza2025 & Pose estimation with ML and Vision Transformers & OpenPose, AlphaPose, HRNet, Vision Transformers & 98.90% (Le2i), 96.44% (UP-Fall), 98.43% (UR-Fall) \\Cai2025 & Deep Learning (Video Swin Transformer with hierarchical self-attention) & Vision Transformer, spatiotemporal attention, multiresolution & 96.1% (Le2i) / 97.0% (UR-Fall) / F1=96.4% / Recall=95.8% \\Shin2025 & GCN + Sep-TCN (three spatiotemporal streams) & Skeleton-based learning, body graphs, separable temporal convolution & 99.68% (ImViA) / 99.97% (UP-Fall) / 99.47% (FU-Kinect) / 98.97% (UR-Fall) \\Fu2025 & 3D-SCNN with sparse convolutions for spatiotemporal optimization & Sparse CNN, video analysis, computational load reduction & 99.82% (UR-Fall) / 96.59% (Multi-Camera) \\Kibet2024 & Transformer encoder-decoder on MediaPipe Pose joint sequences & Pose estimation, temporal analysis, Transformer & 97.6% (own dataset) \\Cai2025b & Vision Transformer (ViT) for global fall detection in video & Global self-attention, action recognition, motion analysis & 95.8% (Le2i, UR-Fall) / Sens.=94.6% / F1=95.2% \\Dutt2024 & Modular DL (STFT + 1D-CNN + OpenPose + GradCAM) & Pose estimation, Fourier Transform, interpretability (GradCAM) & 96–98% (UR-Fall, NTU RGB+D, MCFD) \\Ma2025 & Deep Learning (YOLOv11 + STGCN) integrated with Edge Computing & YOLOv11, AlphaPose, STGCN, deployment on Jetson devices (NX and AGX Orin) & Acc., Rec., F1 >0.98 (own dataset); FP 12–16%, FN 15–18% (edge test) \\Li2024 & Deep Learning (Pyramid Network + Transformer + Feature Fusion + GRU) & CNN for image reduction, Transformer with pooling for spatial feature extraction, feature fusion module, GRU for temporal extraction & 99.61% (UR-Fall), 99.33% (Le2i) \\\bottomrule\end{tabular}\end{table}
Based on the review presented in Table 1, it is observed that most recent studies combine CNNs with attention-based or time-series models, such as LSTM or Transformers. This trend reflects a shift toward hybrid models capable of integrating spatial and temporal information more efficiently.
Regarding datasets, the most commonly used remain UR-Fall, UP-Fall, and Le2i, as they are publicly available and facilitate the comparison of results among different studies. Various works [Gaya-Morey2024, Capodici2025 highlight their role as benchmarks for validating new fall detection models. However, they also point out that these datasets have limitations, as they were recorded in controlled environments with few participants and conditions that are not representative of real-world scenarios. For this reason, new and more diverse datasets are being developed, including people of different ages, contexts, and environments, aiming to improve model generalization and performance in real-life situations.
Based on the limitations observed in the available datasets and current methodologies, there is a need to explore new combinations of architectures that efficiently integrate detection, pose estimation, and temporal analysis. The contribution of this work lies in proposing and evaluating a system that combines three main modules: person detection using YOLOv11m, pose estimation with MediaPipe, and temporal analysis using an LSTM network, to determine its effectiveness in the automatic detection of falls in older adults.
Methodology
The general approach is grounded in the sequential integration of three main processes: person detection, pose estimation, and temporal analysis of body movement. From video sequences, the system identifies the individuals present in the scene, extracts the structural information of their joints, and analyzes the temporal patterns associated with postural transitions that characterize a fall event.The following subsections describe in detail the system components, the dataset used, the preprocessing stages, the model architecture, and the training procedure.
Data sources and preprocessing
For the development and evaluation of the model, a selection from the public Le2i Fall Detection Dataset was used, which is widely employed in computer vision–based fall detection research Charfi2013. This dataset contains video recordings in indoor environments such as living rooms, offices, and corridors, capturing both simulated falls and daily activities. The sequences were recorded with RGB and RGB-D cameras and include variations in lighting, camera position, and fall direction, providing diversity and realism to the analyzed scenarios Charfi2013.
In this work, the Coffee Room 1 and Coffee Room 2 subsets of dataset were used, comprising a total of 70 videos recorded in a controlled environment simulating a typical living room. The recordings have a uniform resolution of 320×180 pixels and were captured with RGB cameras under different lighting and perspective conditions.
Each video includes one or more sequences in which participants perform daily activities (such as walking, sitting, or picking up objects) and simulated falls in different directions (frontal, backward, and lateral). Figures 13 show representative examples of the three main scenarios considered in the dataset: a daily activity (1), a fall in progress (2), and a person on the floor after the fall (3).
Fig. 1
Example frame corresponding to a daily activity.
Click here to Correct
Fig. 2
Example frame corresponding to a fall in progress.
Click here to Correct
Fig. 3
Example frame corresponding to a person on the floor after the fall.
Click here to Correct
Each frame in the dataset does not originally contain an associated action label. The dataset only provides temporal markers indicating the start and end of each fall within the videos. Based on this information, a frame-by-frame labeling process was developed to assign a class to each frame. The defined labels were: 0, corresponding to routine or non-fall activities; 1, for frames in which the person is in the process of falling; and 2, for those in which the person is already on the floor.
A
This procedure made it possible to transform the temporal information of the dataset into a fully labeled dataset suitable for training and evaluating the supervised classification model. Table 2 presents an example of the structure of the labels assigned to each frame within a video.
begin{table}[h!]\caption{Example of the tags for each frame within a video.}\label{tab:tabla_ejemplo_anotacion}\centering\begin{tabular}{lcc}\hlineVideo & Frame & Label \\\hlineCoffee_room_01-fall-01 & 1 & 0 \\Coffee_room_01-fall-01 & 2 & 0 \\Coffee_room_01-fall-01 & 3 & 0 \\\vdots & & \\Coffee_room_01-fall-01 &
--1 & 0 \\\hlineCoffee_room_01-fall-01 &
& 1 \\Coffee_room_01-fall-01 &
+1 & 1 \\Coffee_room_01-fall-01 &
+2 & 1 \\\vdots & & \\Coffee_room_01-fall-01 &
& 1 \\\hlineCoffee_room_01-fall-01 &
+1 & 2 \\Coffee_room_01-fall-01 &
+2 & 2 \\Coffee_room_01-fall-01 &
+3 & 2 \\\vdots & & \\Coffee_room_01-fall-01 &
& 2 \\\hline\end{tabular}\end{table}
In this table,
represents the frame number where the fall begins,
corresponds to the frame where the fall ends, and
indicates the total number of frames in the video. In this way, the temporal sequence of each video is segmented into three clearly differentiated phases, allowing the model to progressively learn the transition between normal activities and fall events.
An important limitation of the dataset is that it does not include keypoint or joint annotations, which represent the skeletal position of the person in each frame. These coordinates are essential for training the LSTM network and performing classification based on body postures. Therefore, a complementary pose estimation process was developed using the MediaPipe Pose model to generate the required joint coordinates and enable the training of the proposed model.
During preprocessing, the videos were converted to RGB format and segmented into frames at a rate of 30 frames per second (fps). From these frames, the YOLOv11m person detection model was applied, which allowed delimiting the regions of interest (bounding boxes) corresponding to the individuals present in each scene. The detected regions were then processed using the MediaPipe Pose model, which estimates the coordinates of 33 human body keypoints. Figure 4 shows the different stages of this process.
Fig. 4
Processing flow of each frame.
Click here to Correct
Since MediaPipe internally performs image resizing and normalization before inference, no manual spatial normalization was applied beforehand. The resulting coordinates were stored as two-dimensional vectors
. These coordinates were associated with the corresponding label for each frame, generating a dataset composed of 66 features formed by the 33 pairs of coordinates for each keypoint and the target variable.
It is important to note that both YOLOv11 and MediaPipe Pose, being computer vision models based on deep neural networks, were not able to reliably detect the person and their joint points in all frames. Therefore, frames in which no valid detection was obtained were discarded from the final dataset to maintain the quality and consistency of the sequences used for training.
A
Table 3 presents a simplified example of the resulting dataset structure corresponding to a single video, showing the assigned labels, estimated joint coordinates, and an additional field indicating the validity of each detection.
begin{table}[h!]\caption{Summary structure of the dataset with labels, keypoint coordinates, and the \texttt{keypoints_info} field for each frame.}\label{tab:keypoints}\centering\begin{tabular}{lccc}\hlineFrame & Label &
keypoints \\\hline1 & 0 &
& True \\2 & 0 &
& False \\3 & 0 &
& True \\\vdots & \vdots & \vdots & \vdots \\
--1 & 0 &
& True \\\hline
& 1 &
& True \\\vdots & \vdots & \vdots & \vdots \\
& 1 &
& True \\\hline
+1 & 2 &
& False \\\vdots & \vdots & \vdots & \vdots \\
& 2 &
& False \\\hline\end{tabular}\end{table}
In the first column, the frame number within the video is recorded. The second column indicates the corresponding class label: 0 for daily or non-fall activities, 1 for the falling process, and 2 for the person on the floor. The following 66 columns store the two-dimensional coordinates
of the 33 keypoints identified by the model. Finally, the last column, named Keypoints info, specifies whether the model successfully detected the body structure in the frame (True) or not (False); records with a False value were excluded from the final dataset before training the LSTM model.
Once the data were processed, the frames of each video were grouped into sequences of 30 frames, assigning as the label for each sequence the value of the target variable corresponding to the last frame of the group. This procedure aimed to enable the model to predict the current action based on a history of previous movements.
After grouping, the dataset features adopted a three-dimensional structure
, where
corresponds to the number of sequences,
to the number of frames per sequence, and
to the number of features per frame. The target variable, in turn, has a dimension of
, corresponding to one label per sequence. As a result of this process, 23,393 event sequences were obtained, of which 17,850 correspond to class 0 (normal activity), 1,340 correspond to class 1 (fall in progress), and 4,203 correspond to class 2 (person on the floor). Figure 5 shows the proportion of each class within the dataset.
Fig. 5
Dataset distribution.
Click here to Correct
With the features and target variable properly shaped for an LSTM model, the dataset was divided into three subsets: 70% of the data was allocated for model training, and the remaining 30% was used for validation and testing. Each subset was divided into batches of 64 samples.
Proposed system architecture
The proposed system for automatic fall detection is based on a sequential approach that integrates three main modules: person detection, pose estimation, and temporal analysis. This hybrid architecture allows for the combination of spatial and temporal information, capturing both the localization of individuals and the evolution of their posture over time. The general system flow is illustrated in Figure 6.
Fig. 6
General flow of the proposed system for automatic fall detection.
Click here to Correct
Person detection module (YOLOv11)
In the first stage, the medium variant of the YOLOv11 model is employed for single-pass object detection. This version incorporates optimized convolutions, enhanced attention mechanisms, and a redesigned neck structure, improving generalization, reducing overfitting, and balancing accuracy with inference speed. Its computational efficiency makes it suitable for processing video streams, and recent studies demonstrate its effectiveness in domestic environments Redmon2015, YOLOv11_2025.
Pose estimation module (MediaPipe BlazePose)
The regions of interest detected by YOLOv11 are processed using MediaPipe Pose, based on the BlazePose model Bazarevsky2020, which estimates three-dimensional coordinates of 33 body keypoints from RGB images. Its lightweight design allows real-time inference on embedded devices Zhang2023, producing a robust skeletal representation that is resilient to occlusions and lighting changes. Keypoint-based representations have been shown to improve discrimination between normal activities and falls by capturing body kinematics Raza2025, Shin2025, Kibet2024.
Temporal analysis module (LSTM)
The set of obtained joint coordinates is organized as a temporal sequence and fed into a recurrent neural network of the Long Short-Term Memory (LSTM) type. LSTMs are an extension of conventional recurrent neural networks (RNNs) that use input, forget, and output gates to regulate the flow of information and mitigate the vanishing gradient problem. This ability allows them to model long-range temporal dependencies, capturing abrupt posture transitions that may indicate the occurrence of a fall.
The proposed model comprises three sequential LSTM layers with 128 units each, followed by a dense layer of 128 neurons and an output layer with three units using softmax activation. The softmax function converts the output into a probability distribution over the classes, enabling multiclass classification. For an output vector
with
classes, the softmax activation function is defined as:
Where
is the exponential of the value
, and
is the sum of all exponentials, ensuring that the resulting probabilities are non-negative and sum to one Goodfellow2016. The architecture is illustrated in Figure 7.
usetikzlibrary{positioning}
Fig. 7
Internal architecture of the LSTM module for temporal classification of joint sequences.
Training procedure
The training was executed using the sparse categorical cross-entropy loss function, a suitable option for multiclass classification with integer labels. The objective of this function is to calculate the discrepancy between the probability distribution predicted by the model and the actual class. The objective is to minimize this discrepancy, with predictions of low probability for the correct class being subject to more severe penalization.
For a set of
samples, the sparse categorical cross-entropy is defined as:
Where
is the probability predicted by the model for the true class
of sample
,
corresponds to the integer representing the correct class, and
is the total number of samples Goodfellow2016.
The model was trained using the Adam optimizer for 50 epochs with a batch size of 64 sequences, splitting the dataset into 70 percent for training and 30 percent for validation and testing. Weights were updated via backpropagation through time, with an initial learning rate of
and early stopping to prevent overfitting. Training was implemented in Python using PyTorch, MediaPipe, and YOLOv11 on an NVIDIA RTX 5060 GPU and an AMD Ryzen 7 CPU. Loss metrics were monitored during each epoch to track learning and detect overfitting.
Model evaluation
The model was evaluated using standard metrics for multiclass classification, including precision, recall, and the F1-score, to assess both overall performance and the ability to distinguish between classes. Additionally, the confusion matrix was analyzed to identify correct and incorrect predictions for each category, highlighting potential misclassifications between normal activities, falls in progress, and individuals on the floor.
Results
A
The proposed model was evaluated using the test set, achieving an overall accuracy of 99.23%. The precision, recall, and F1-score values obtained for each class are presented in Table 4, showing consistent performance across all three categories. Class 0 (no fall) achieved a precision of 99.67% and a recall of 99.93%; class 1 (partial fall) achieved a precision of 93.87% and a recall of 93.43%; while class 2 (complete fall) reached a precision of 99.17% and a recall of 98.20%. The weighted average of the metrics reflects balanced performance, with an overall F1-score of 97.38%.
begin{table}[htpb]\centering\caption{Evaluation metrics by class.}\label{tab:metricas}\begin{tabular}{lcccc}\hlineClass & Precision & Recall & F1-score & Support \\\hline0 (No fall) & 0.9967 & 0.9993 & 0.9980 & 2685 \\1 (Partial fall) & 0.9387 & 0.9343 & 0.9365 & 213 \\2 (Complete fall) & 0.9917 & 0.9820 & 0.9869 & 612 \\\hlineWeighted avg. & 0.9923 & 0.9923 & 0.9923 & 3510 \\Macro avg. & 0.9757 & 0.9719 & 0.9738 & 3510 \\\hline\end{tabular}\end{table}
The evaluation results indicate that the proposed model achieves a very high level of overall correctness, suggesting strong reliability in distinguishing between the different movement categories. The performance for normal activities stands out as the most consistent, showing that the system can accurately recognize situations where no fall occurs and avoid generating unnecessary alerts. This characteristic is essential for real-world use, as excessive false alarms can reduce trust in the system.
The model also performs very well when identifying complete falls. This demonstrates its ability to detect critical events with very few mistakes, which is particularly important in safety-related applications where failing to recognize an actual fall could lead to serious risks.
Partial falls are more challenging due to subtle and ambiguous movements, but the model maintains reliable performance across all categories. Overall, the results demonstrate balanced behavior and support the suitability of the system for practical deployment in distinguishing normal activities and detecting falls.
The confusion matrix (Figure 8) shows that classification errors are minimal, with slight confusion between classes 1 and 2.
Fig. 8
Confusion matrix of the model.
Click here to Correct
The loss curve (Figure 9) shows rapid convergence during the first 10 epochs and subsequent stability, indicating a training process without overfitting.
Fig. 9
Loss evolution during training and validation.
Click here to Correct
Discussion
The results obtained confirm the strong performance of the proposed model for automatic fall detection. The overall accuracy and the weighted performance score indicate a high capacity for generalization and an effective balance between correctly identifying fall events and minimizing false alarms. This balance is crucial in monitoring environments, where excessive false alerts can undermine user confidence and hinder system adoption.
The analysis of the confusion matrix reveals that classification errors are mainly concentrated between classes 1 and 2, which can be attributed to the visual similarity between a partial and a complete fall. This phenomenon aligns with previous observations in the literature and reflects the difficulty of temporally delimiting transitions between postural states.
When compared with recent state-of-the-art studies, the obtained results are comparable and demonstrate competitive performance. Hybrid models based on convolutional and transformer networks, such as the one proposed by Nez-Marcos et al. \citep{Nez-Marcos2024}, achieved accuracies above 95% by integrating spatial and temporal information. Similarly, Liu et al. Liu2025 combined YOLOv8 and AlphaPose to improve precision and F1-score, while Cai et al. Cai2025 reported F1 values above 96% with Video Swin Transformer models.
The study presents some limitations, including a controlled dataset, evaluation on short sequences, and the omission of practical factors such as energy consumption and processing time, which may affect generalization and real-time deployment.
Despite its limitations, the proposed system combining YOLOv11m, MediaPipe, and an LSTM network achieves performance comparable to state-of-the-art models while using a lighter architecture and lower computational resources. This efficiency makes it suitable for embedded devices and home monitoring, and the results validate the effectiveness of integrating detection, pose estimation, and temporal analysis for scalable real-time applications.
Conclusions
This work proposed and evaluated an automatic fall detection system that integrates three complementary modules: person detection using YOLOv11m, pose estimation with MediaPipe, and temporal analysis through an LSTM network. The obtained results demonstrated strong performance, achieving an overall accuracy of 99.23\,% and a weighted F1-score of 97.38%, evidencing the effectiveness of the proposed architecture in classifying sequences associated with falls.
Together, the three modules form a hybrid architecture that integrates spatial detection (YOLOv11), structural body description (MediaPipe BlazePose), and temporal modeling (LSTM). This sequential flow efficiently captures the body dynamics associated with falls, ensuring an interpretable representation that is potentially adaptable to edge computing and smart home systems.
The comparative analysis with recent studies demonstrates that the proposed model achieves competitive performance within current trends in computer vision and spatiotemporal analysis. Its combination of efficient and lightweight modules makes it suitable for domestic environments and teleassistance systems, where processing capacity and energy consumption are critical. Despite limitations related to the controlled nature of the dataset, the results validate the proposed approach. Future work will focus on evaluation in more diverse environments, the integration of temporal attention mechanisms or lightweight transformer-based modules, and testing the system on embedded platforms for real-time operation.
Data Availability Statement
The dataset used in this study (Le2i Fall Detection Dataset – Coffee Room subsets) is publicly available and can be accessed through the original repository provided by the authors. No additional data were generated or required for this work.
Funding
This research was supported for grant project number 34416097, Industria 4.0: Integración de tecnologías impulsoras de la industria 4.0.
Competing interests
The authors declare that they have no competing interests.
Author contributions statement
Juan M. Triviño: Writing – original draft, Visualization, Validation, Software, Methodology, Investigation, Formal analysis, Resources, Data curation, Conceptualization. Andrés F. Lasso: Writing – original draft, Visualization, Validation, Software, Methodology, Investigation, Formal analysis, Data curation, Conceptualization. Carlos M. Paredes: Conceptualization, Formal analysis, Investigation, Writing – review & editing, Supervision, Project administration, Funding acquisition. Victor M. Peñeñory: Supervision, Project administration, Funding acquisition.
Editorial Policies for:
Springer journals and proceedings:
Nature Portfolio journals:
:
BMC journals: \bibliography{reference}
A
Author Contribution
J.M.T.: Writing – original draft, Visualization, Validation, Software, Methodology, Investigation, Formal analysis, Resources, Data curation, Conceptualization. A. F. L.: Writing – original draft, Visualization, Validation, Software, Methodology, Investigation, Formal analysis, Data curation, Conceptualization. C. M. P.: Conceptualization, Formal analysis, Investigation, Writing – review \& editing, Supervision, Project administration, Funding acquisition. V. M. P.: Supervision, Project administration, Funding acquisition.
References:
Ian Goodfellow and Yoshua Bengio and Aaron Courville (2016) Deep Learning. MIT Press, Cambridge, MA, https://www.deeplearningbook.org/
Joseph Redmon and Santosh Kumar Divvala and Ross B. Girshick and Ali Farhadi (2015) You Only Look Once: Unified, Real-Time Object Detection. CoRR abs/1506.02640dblp computer science bibliography, https://dblp.org, https://dblp.org/rec/journals/corr/RedmonDGF15.bib, Mon, 13 Aug 2018 16:48:08 +0200, 1506.02640, arXiv, http://arxiv.org/abs/1506.02640
Jocher, Glenn. YOLOv11: Next-Generation Real-Time Object Detection. https://github.com/ultralytics/YOLOv11, 2025
Bazarevsky, Valentin and Zhang, Fan and Vakunov, Andrey and Sung, Gabriel and Grundmann, Matthias (2020) BlazePose: On-device Real-time Body Pose Tracking. CVPR Workshop
Zhang, Wei and Chen, Yu (2023) Real-Time Human Pose Estimation Using Lightweight CNN Architectures. IEEE Access
Hochreiter, Sepp and Schmidhuber, J ürgen (1997) Long Short-Term Memory. Neural Computation 9(8): 1735--1780
Imen Charfi and Johel Miteran and Julien Dubois and Mohamed Atri and Rached Tourki (2013) {Optimized spatio-temporal descriptors for real-time fall detection: comparison of support vector machine and Adaboost-based classification}. Journal of Electronic Imaging 22(4): 041106 https://doi.org/10.1117/1.JEI.22.4.041106, https://doi.org/10.1117/1.JEI.22.4.041106, Video, Cameras, Education and training, Feature extraction, Object detection, Video surveillance, Image classification, Feature selection, Tunable filters, Actinium, SPIE
Bogdan Kwolek and Michal Kepski. Human fall detection on embedded platform using depth maps and wireless accelerometer. Assistive technology,Depth image analysis,Fall detection,Sensor technology for smart homes, Since falls are a major public health problem in an ageing society, there is considerable demand for low-cost fall detection systems. One of the main reasons for non-acceptance of the currently available solutions by seniors is that the fall detectors using only inertial sensors generate too much false alarms. This means that some daily activities are erroneously signaled as fall, which in turn leads to frustration of the users. In this paper we present how to design and implement a low-cost system for reliable fall detection with very low false alarm ratio. The detection of the fall is done on the basis of accelerometric data and depth maps. A tri-axial accelerometer is used to indicate the potential fall as well as to indicate whether the person is in motion. If the measured acceleration is higher than an assumed threshold value, the algorithm extracts the person, calculates the features and then executes the SVM-based classifier to authenticate the fall alarm. It is a 365/7/24 embedded system permitting unobtrusive fall detection as well as preserving privacy of the user.
Manning, Christopher D. and Raghavan, Prabhakar and Sch ütze, Hinrich (2008) Introduction to Information Retrieval. Cambridge University Press, Cambridge, 978-0-521-86571-5
Wang, Xueyi and Ellul, Joshua and Azzopardi, George (2020) Elderly Fall Detection Systems: A Literature Survey. Frontiers in Robotics and AI Volume 7 - 2020 https://doi.org/10.3389/frobt.2020.00071, <p >Falling is among the most damaging event elderly people may experience. With the ever-growing aging population, there is an urgent need for the development of fall detection systems. Thanks to the rapid development of sensor networks and the Internet of Things (IoT), human-computer interaction using sensor fusion has been regarded as an effective method to address the problem of fall detection. In this paper, we provide a literature survey of work conducted on elderly fall detection using sensor networks and IoT. Although there are various existing studies which focus on the fall detection with individual sensors, such as wearable ones and depth cameras, the performance of these systems are still not satisfying as they suffer mostly from high false alarms. Literature shows that fusing the signals of different sensors could result in higher accuracy and lower false alarms, while improving the robustness of such systems. We approach this survey from different perspectives, including data collection, data transmission, sensor fusion, data analysis, security, and privacy. We also review the benchmark data sets available that have been used to quantify the performance of the proposed methods. The survey is meant to provide researchers in the field of elderly fall detection using sensor networks with a summary of progress achieved up to date and to identify areas where further effort would be beneficial. </p >, 2296-9144, https://www.frontiersin.org/journals/robotics-and-ai/articles/10.3389/frobt.2020.00071
Xi Cai and Xiangcheng Wang and Yin Jiao and Yinuo Chen and Guang Han (2025) Multi-scale motion perception fall detection algorithm based on video swin transformer. Signal, Image and Video Processing 19 https://doi.org/10.1007/s11760-025-04358-3, Springer Science and Business Media Deutschland GmbH, 10, Computer vision,Deep learning,Fall detection,Video swin transformer, 10, 18631711, Fall detection is a prominent subject in healthcare. Advancements in modern monitoring and deep learning have sparked significant social interest in visual fall detection. Despite the success of various deep learning methods in video fall detection owing to their superior feature extraction capabilities, they still encounter challenges in analyzing long-range or short-range spatiotemporal correlations. Taking this into account, a multi-scale motion perception fall detection algorithm based on video swin transformer is proposed in this study. Our proposed method employs video swin transformer as the backbone to fully model the global and local spatiotemporal information from videos and optimizes the backbone with two integrated modules. On one hand, we design a multi-scale motion information aggregation module to overcome the difficulty of the model in focusing on key multi-scale motion features. On the other hand, we propose a token pruning module to reduce the computational cost by pruning redundant temporal tokens. Experimental results demonstrate that the proposed algorithm exhibits promising outcomes, with an accuracy of 96.11% and 97.05% on the Le2i and UR fall detection datasets, respectively, thus outperforming some existing advanced algorithms.
Jungpil Shin and Abu Saleh Musa Miah and Rei Egawa and Koki Hirooka and Md Al Mehedi Hasan and Yoichi Tomioka and Yong Seok Hwang (2025) Fall recognition using a three stream spatio temporal GCN model with adaptive feature aggregation. Scientific Reports 15 https://doi.org/10.1038/s41598-025-95508-7, Nature Research, 40148548, 12, Ageing people,AlphaPose,Classification,Fall detection (FD),Graph convolutional network (GCN),Human activity recognition (HAR),Multi-stream deep learning,Sep-TCN,Spatio-temporal, 1, 20452322, The prevention of falls is paramount in modern healthcare, particularly for the elderly, as falls can lead to severe injuries or even fatalities. Additionally, the growing incidence of falls among the elderly, coupled with the urgent need to prevent suicide attempts resulting from medication overdose, underscores the critical importance of accurate and efficient methods of detecting a fall. This makes a computer-aided fall detection system necessary to save elderly people ’s lives worldwide. Many researchers have been working to develop fall detection systems. However, the existing systems often struggle with problems such as unsatisfactory accuracy, limited robustness, high computational complexity, and sensitivity to environmental factors. In response to these challenges, this paper proposes a novel three-stream spatio-temporal feature-based human fall detection system. Our system incorporates joint skeleton-based spatial and temporal Graph Convolutional Network (GCN) features, joint motion-based spatial and temporal GCN features, and residual connections-based features. Each stream employs adaptive graph-based feature aggregation and consecutive separable convolutional neural networks (Sep-TCN), significantly reducing the computational complexity and the number of parameters of the model compared to prior systems. Experimental results on multiple datasets demonstrate the superior effectiveness and efficiency of our proposed system, with accuracies of 99.68%, 99.97%, 99.47 % and 98.97% achieved on the ImViA, Fall-UP, FU-Kinect and UR-Fall datasets, respectively. The remarkable performance of our system highlights its superiority, efficiency, and generalizability in real-world human fall detection scenarios, offering significant advancements in healthcare and societal well-being.
Fangping Fu (2025) A VIDEO-BASED FALL DETECTION USING 3D SPARSE CONVOLUTIONAL NEURAL NETWORK IN ELDERLY CARE SERVICES. Machine Graphics and Vision 34: 53-74 https://doi.org/10.22630/MGV.2025.34.1.3, Institute of Information Technology, Warsaw University of Life Sciences - SGGW, 3D convolutional neural network,fall detection,jitter buffer,sparse convolution, 1, 2720250X, Falls in the elderly have become one of the major risks for the growing elderly population. Therefore, the application of automatic fall detection system for the elderly is particularly important. In recent years, a large number of deep learning methods (such as CNN) have been applied to such research. This paper proposed a sparse convolution method 3D Sparse Convolutions and the corresponding 3D Sparse Convolutional Neural Network (3D-SCNN), which can achieve faster convolution at the approximate accuracy, thereby reducing computational complexity while maintaining high accuracy in video analysis and fall detection task. Additionally, the preprocessing stage involves a dynamic key frame selection method, using the jitter buffers to adjust frame selection based on current network conditions and buffer state. To ensure feature continuity, overlapping cubes of selected frames are intentionally employed, with dynamic resizing to adapt to network dynamics and buffer states. Experiments are conducted on Multi-camera fall dataset and UR fall dataset, and the results show that its accuracy exceeds the three compared methods, and outperforms the traditional 3D-CNN methods in both accuracy and losses.
Xi Cai and Xiangcheng Wang and Kexin Bao and Yinuo Chen and Yin Jiao and Guang Han (2025) Visual perception enhancement fall detection algorithm based on vision transformer. Signal, Image and Video Processing 19 https://doi.org/10.1007/s11760-024-03652-w, Springer Science and Business Media Deutschland GmbH, 1, Attention mechanism,Computer vision,Deep learning,Fall detection,Vision transformer, 1, 18631711, Fall detection is a crucial research topic in public healthcare. With advances in intelligent surveillance and deep learning, vision-based fall detection has gained significant attention. While numerous deep learning algorithms prevail in video fall detection due to excellent feature processing capabilities, they all exhibit limitations in handling long-term spatiotemporal dependencies. Recently, Vision Transformer has shown considerable potential in integrating global information and understanding long-term spatiotemporal dependencies, thus providing novel solutions. In view of this, we propose a visual perception enhancement fall detection algorithm based on Vision Transformer. We utilize Vision Transformer-Base as the baseline model for analyzing global motion information in videos. On this basis, to address the model ’s difficulty in capturing subtle motion changes across video frames, we design an inter-frame motion information enhancement module. Concurrently, we propose a locality perception enhancement self-attention mechanism to overcome the model ’s weak focus on local key features within the frame. Experimental results show that our method achieves notable performance on the Le2i and UR datasets, surpassing several advanced methods.
Duncan Kibet and Min Seop So and Hahyeon Kang and Yongsu Han and Jong Ho Shin (2024) Sudden Fall Detection of Human Body Using Transformer Model. Sensors 24 https://doi.org/10.3390/s24248051, Multidisciplinary Digital Publishing Institute (MDPI), 39771788, 12, fall detection,pose estimation,speed-based anomaly detection,time-series analysis,transformers, 24, 14248220, In human activity recognition, accurate and timely fall detection is essential in healthcare, particularly for monitoring the elderly, where quick responses can prevent severe consequences. This study presents a new fall detection model built on a transformer architecture, which focuses on the movement speeds of key body points tracked using the MediaPipe library. By continuously monitoring these key points in video data, the model calculates real-time speed changes that signal potential falls. The transformer ’s attention mechanism enables it to catch even slight shifts in movement, achieving an accuracy of 97.6% while significantly reducing false alarms compared to traditional methods. This approach has practical applications in settings like elderly care facilities and home monitoring systems, where reliable fall detection can support faster intervention. By homing in on the dynamics of movement, this model improves both accuracy and reliability, making it suitable for various real-world situations. Overall, it offers a promising solution for enhancing safety and care for vulnerable populations in diverse environments.
Lei Liu and Yeguo Sun and Yinyin Li and Yihong Liu (2025) A hybrid human fall detection method based on modified YOLOv8s and AlphaPose. Scientific Reports 15 https://doi.org/10.1038/s41598-025-86429-6, Nature Research, 39837978, 12, Computer vision,Fall detection,Human pose estimation,Object detection, 1, 20452322, To address the challenges of low detection accuracy of small objects and weak applicability of the multi-person fall action recognition applications, we propose a hybrid fall detection method based on modified YOLOv8s and AlphaPose called HFDMIA-Pose. Firstly, we use the modified Yolov8s as object detector. It uses SPD-Conv to preserve small object features and adds a small object detection layer, while using BCIOU as the loss function. These methods can effectively improve the accuracy of small object detection and significantly reduce the model size. Secondly, we improve the fall recognition accuracy by introducing a hybrid fall detection algorithm based on human skeletal nodes. Lastly, we build a multi-person fall detection dataset (MPFDD) to test the model ’s effectiveness in multi-person scenarios. Validated on datasets Le2i and MPFDD, our method improves accuracy by 4.30%, F1 by 4.57%, and FPS by 37.50% faster than the AlphaPose. Compared with other models, our model improves accuracy by 5.33% on average, F1 by 5.51%, and FPS by 43.05% faster on average. Therefore, HFDMIA-Pose has significantly improved performance compared to the original model and it also demonstrates strong competitiveness over other advanced human fall detection models. Furthermore, it has the advantages of high detection accuracy, fewer model size, and fast speed, which makes it more suitable for resource constrained edge environments and can meet industrial and daily scenarios.
F. Xavier Gaya-Morey and Cristina Manresa-Yee and Jos é M. Buades-Rubio (2024) Deep learning for computer vision based activity recognition and fall detection of the elderly: a systematic review. Applied Intelligence 54: 8982-9007 https://doi.org/10.1007/s10489-024-05645-1, Springer, 10, Ambient assisted living,Computer vision,Deep learning,Elderly,Fall detection,Human activity recognition, 19, 15737497, Abstract: As the proportion of elderly individuals in developed countries continues to rise globally, addressing their healthcare needs, particularly in preserving their autonomy, is of paramount concern. A growing body of research focuses on Ambient Assisted Living (AAL) systems, aimed at alleviating concerns related to the independent living of the elderly. This systematic review examines the literature pertaining to fall detection and Human Activity Recognition (HAR) for the elderly, two critical tasks for ensuring their safety when living alone. Specifically, this review emphasizes the utilization of Deep Learning (DL) approaches on computer vision data, reflecting current trends in the field. A comprehensive search yielded 2,616 works from five distinct sources, spanning the years 2019 to 2023 (inclusive). From this pool, 151 relevant works were selected for detailed analysis. The review scrutinizes the employed DL models, datasets, and hardware configurations, with particular emphasis on aspects such as privacy preservation and real-world deployment. The main contribution of this study lies in the synthesis of recent advancements in DL-based fall detection and HAR for the elderly, providing insights into the state-of-the-art techniques and identifying areas for further improvement. Given the increasing importance of AAL systems in enhancing the quality of life for the elderly, this review serves as a valuable resource for researchers, practitioners, and policymakers involved in developing and implementing such technologies. Graphical abstract: (Figure presented.).
Angelo Capodici and Claudio Fanconi and Catherine Curtin and Alessandro Shapiro and Francesca Noci and Alberto Giannoni and Tina Hernandez-Boussard (2025) A scoping review of machine learning models to predict risk of falls in elders, without using sensor data. Diagnostic and Prognostic Research 9 https://doi.org/10.1186/s41512-025-00190-y, Springer Science and Business Media LLC, 5, 1, This scoping review assesses machine learning (ML) tools that predicted falls, relying on information in health records without using any sensor data. The aim was to assess the available evidence on innovative techniques to improve fall prevention management. Studies were included if they focused on predicting fall risk with machine learning in elderly populations and were written in English. There were 13 different extracted variables, including population characteristics (community dwelling, inpatients, age range, main pathology, ethnicity/race). Furthermore, the number of variables used in the final models, as well as their type, was extracted. A total of 6331 studies were retrieved, and 19 articles met criteria for data extraction. Metric performances reported by authors were commonly high in terms of accuracy (e.g., greater than 0.70). The most represented features included cardiovascular status and mobility assessments. Common gaps identified included a lack of transparent reporting and insufficient fairness assessments. This review provides evidence that falls can be predicted using ML without using sensors if the amount of data and its quality is adequate. However, further studies are needed to validate these models in diverse groups and populations.
Jianjun Yan and Xueqiang Wang and Jiangtao Shi and Shuai Hu (2023) Skeleton-Based Fall Detection with Multiple Inertial Sensors Using Spatial-Temporal Graph Convolutional Networks. Sensors 23 https://doi.org/10.3390/s23042153, MDPI, 36850753, 2, fall detection,multiple inertial sensors,skeleton,spatial-temporal graph convolutional networks, 4, 14248220, The application of wearable devices for fall detection has been the focus of much research over the past few years. One of the most common problems in established fall detection systems is the large number of false positives in the recognition schemes. In this paper, to make full use of the dependence between human joints and improve the accuracy and reliability of fall detection, a fall-recognition method based on the skeleton and spatial-temporal graph convolutional networks (ST-GCN) was proposed, using the human motion data of body joints acquired by inertial measurement units (IMUs). Firstly, the motion data of five inertial sensors were extracted from the UP-Fall dataset and a human skeleton model for fall detection was established through the natural connection relationship of body joints; after that, the ST-GCN-based fall-detection model was established to extract the motion features of human falls and the activities of daily living (ADLs) at the spatial and temporal scales for fall detection; then, the influence of two hyperparameters and window size on the algorithm performance was discussed; finally, the recognition results of ST-GCN were also compared with those of MLP, CNN, RNN, LSTM, TCN, TST, and MiniRocket. The experimental results showed that the ST-GCN fall-detection model outperformed the other seven algorithms in terms of accuracy, precision, recall, and F1-score. This study provides a new method for IMU-based fall detection, which has the reference significance for improving the accuracy and robustness of fall detection.
Mohammadamin Salimi and Jos é J.M. Machado and Jo ão Manuel R.S. Tavares (2022) Using Deep Neural Networks for Human Fall Detection Based on Pose Estimation. Sensors 22 https://doi.org/10.3390/s22124544, MDPI, 35746325, 6, computer vision,deep learning,image analysis,machine learning, 12, 14248220
Jinxi Zhang and Zhen Li and Yu Liu and Jian Li and Hualong Qiu and Mohan Li and Guohui Hou and Zhixiong Zhou (2024) An Effective Deep Learning Framework for Fall Detection: Model Development and Study Design. Journal of Medical Internet Research 26 https://doi.org/10.2196/56750, JMIR Publications Inc., 39102676, MobiFall,Sisfall,accelerometer,deep learning,fall detection,gyroscope,human health,self-attention,wearable sensors, 14388871, Background: Fall detection is of great significance in safeguarding human health. By monitoring the motion data, a fall detection system (FDS) can detect a fall accident. Recently, wearable sensors –based FDSs have become the mainstream of research, which can be categorized into threshold-based FDSs using experience, machine learning –based FDSs using manual feature extraction, and deep learning (DL) –based FDSs using automatic feature extraction. However, most FDSs focus on the global information of sensor data, neglecting the fact that different segments of the data contribute variably to fall detection. This shortcoming makes it challenging for FDSs to accurately distinguish between similar human motion patterns of actual falls and fall-like actions, leading to a decrease in detection accuracy. Objective: This study aims to develop and validate a DL framework to accurately detect falls using acceleration and gyroscope data from wearable sensors. We aim to explore the essential contributing features extracted from sensor data to distinguish falls from activities of daily life. The significance of this study lies in reforming the FDS by designing a weighted feature representation using DL methods to effectively differentiate between fall events and fall-like activities. Methods: Based on the 3-axis acceleration and gyroscope data, we proposed a new DL architecture, the dual-stream convolutional neural network self-attention (DSCS) model. Unlike previous studies, the used architecture can extract global feature information from acceleration and gyroscope data. Additionally, we incorporated a self-attention module to assign different weights to the original feature vector, enabling the model to learn the contribution effect of the sensor data and enhance classification accuracy. The proposed model was trained and tested on 2 public data sets: SisFall and MobiFall. In addition, 10 participants were recruited to carry out practical validation of the DSCS model. A total of 1700 trials were performed to test the generalization ability of the model. Results: The fall detection accuracy of the DSCS model was 99.32% (recall=99.15%; precision=98.58%) and 99.65% (recall=100%; precision=98.39%) on the test sets of SisFall and MobiFall, respectively. In the ablation experiment, we compared the DSCS model with state-of-the-art machine learning and DL models. On the SisFall data set, the DSCS model achieved the second-best accuracy; on the MobiFall data set, the DSCS model achieved the best accuracy, recall, and precision. In practical validation, the accuracy of the DSCS model was 96.41% (recall=95.12%; specificity=97.55%). Conclusions: This study demonstrates that the DSCS model can significantly improve the accuracy of fall detection on 2 publicly available data sets and performs robustly in practical validation.
Tin-Han Chi and Kai-Chun Liu and Chia-Yeh Hsieh and Yu Tsao and Chia-Tai Chan. PREFALLKD: PRE-IMPACT FALL DETECTION VIA CNN-VIT KNOWLEDGE DISTILLATION. Index Terms-Inertial measurement units,Knowledge distillation,Pre-impact fall detection,Vision transformer,Wearable sensors, Fall accidents are critical issues in an aging and aged society. Recently, many researchers developed "pre-impact fall detection systems" using deep learning to support wearable-based fall protection systems for preventing severe injuries. However, most works only employed simple neural network models instead of complex models considering the usability in resource-constrained mobile devices and strict latency requirements. In this work, we propose a novel pre-impact fall detection via CNN-ViT knowledge distillation, namely PreFallKD, to strike a balance between detection performance and computational complexity. The proposed PreFallKD transfers the detection knowledge from the pre-trained teacher model (vision transformer) to the student model (lightweight convolutional neural networks). Additionally, we apply data augmentation techniques to tackle issues of data imbalance. We conduct the experiment on the KFall public dataset and compare PreFallKD with other state-of-the-art models. The experiment results show that PreFallKD could boost the student model during the testing phase and achieves reliable F1-score (92.66%) and lead time (551.3 ms).
{World Health Organization}. Ageing and health. Fact sheet, updated Oct.\ 1, 2024. October, 2024, https://www.who.int/news-room/fact-sheets/detail/ageing-and-health
{World Health Organization}. Falls. Fact sheet, updated Jan.\ 26, 2024. January, 2024, https://www.who.int/news-room/fact-sheets/detail/falls
P. J. Mercy and Sandhya K. Neelamana and Vijayan C. Parameswaran Nair (2024) Prevalence and risk factors for falls among the community dwelling older adults of Thrissur: A pilot study. Journal of Family Medicine and Primary Care 13: 875-880 https://doi.org/10.4103/jfmpc.jfmpc_2441_22, Medknow, 3, 3, 2249-4863
Amrit Pal Kaur and Ejay Nsugbe and Amy Drahota and Matthew Oldfield and Iman Mohagheghian and Radu A. Sporea (2025) State-of-the-art fall detection techniques with emphasis on floor-based systems —A review. Biomedical Engineering Advances 9: 100179 https://doi.org/10.1016/J.BEA.2025.100179, https://www.sciencedirect.com/science/article/pii/S2667099225000350?ref=pdf_download &fr=RR-2 &rr=97c281838fb43eff#page=10 &zoom=100,0,0, Elsevier, 6, 2667-0992, Abstract The Arthritis patients and aging population has challenged society to develop safer, independent living environments. Falls, associated injuries, and delays in fall treatment are major causes of morbidity and death in older adults. Therefore, fall detection systems are fundamental to reducing fall risks and building safer environments. Designing fall detection systems is an emerging field of research. The development of the system relies on a sensing mechanism, processing unit, and communication to alert the emergency facilities. Each module is crucial in providing a cost-effective, accurate, reliable, and robust solution. Technological advancements in fall detection systems, particularly wearable and non-wearable devices, offer promising solutions. Wearable systems are prevalent due to their cost-effectiveness and ease of installation, but they can be unreliable if not worn consistently. Non-wearable systems, including smart flooring, provide continuous monitoring but are expensive and complex to maintain. This article reviews the development and deployment of fall detection technologies, examining their practical limitations and emphasizing floor-based detection systems as a viable solution for fostering independent living among older adults.
Gomez, Diego and Medina, Mar ía and Rojas, Jes ús and Inga-Ortega, Juan (2024) Intelligent Elderly Monitoring System Using Computer Vision. 10.1109/COLCOM62950.2024.10720249, 1-6, 08
Adri án N ú ñez-Marcos and Ignacio Arganda-Carreras (2024) Transformer-based fall detection in videos. Engineering Applications of Artificial Intelligence 132 https://doi.org/10.1016/j.engappai.2024.107937, Elsevier Ltd, 6, Computer vision,Fall detection,Health,Transformer, 09521976, Falls pose a major threat for the elderly as they result in severe consequences for their physical and mental health or even death in the worst-case scenario. Nonetheless, the impact of falls can be alleviated with appropriate technological solutions. Fall detection is the task of recognising a fall, i.e. detecting when a person has fallen in a video. Such an algorithm can be implemented in lightweight devices which can then cater to the users ’ needs, e.g. alerting emergency services or caregivers. At the core of those systems, a model capable of promptly recognising falls is crucial for reducing the time until help comes. In this paper we propose a fall detection solution based on transformers, i.e. state-of-the-art neural networks for computer vision tasks. Our model takes a video clip and decides if a fall has occurred or not. In a video stream, it would be applied in a sliding-window fashion to trigger an alarm as soon as it detects a fall. We evaluate our fall detection backbone model on the large UP-Fall dataset, as well as on the UR fall dataset, and compare our results with existing literature using the former dataset.
Micheal Dutt and Aditya Gupta and Morten Goodwin and Christian W. Omlin (2024) An Interpretable Modular Deep Learning Framework for Video-Based Fall Detection. Applied Sciences (Switzerland) 14 https://doi.org/10.3390/app14114722, Multidisciplinary Digital Publishing Institute (MDPI), 6, OpenPose,classification module,computational efficiency,deep learning,fall detection,gradient-weighted class activation mapping,privacy preservation,real-time monitoring,short-time Fourier transform,video-based systems, 11, 20763417, Falls are a major risk factor for older adults, increasing morbidity and healthcare costs. Video-based fall-detection systems offer crucial real-time monitoring and assistance. Yet, their deployment faces challenges such as maintaining privacy, reducing false alarms, and providing understandable outputs for healthcare providers. This paper introduces an innovative automated fall-detection framework that includes a Gaussian blur module for privacy preservation, an OpenPose module for precise pose estimation, a short-time Fourier transform (STFT) module to capture frames with significant motion selectively, and a computationally efficient one-dimensional convolutional neural network (1D-CNN) classification module designed to classify these frames. Additionally, integrating a gradient-weighted class activation mapping (GradCAM) module enhances the system ’s explainability by visually highlighting the movement of the key points, resulting in classification decisions. Modular flexibility in our system allows customization to meet specific privacy and monitoring needs, enabling the activation or deactivation of modules according to the operational requirements of different healthcare settings. This combination of STFT and 1D-CNN ensures fast and efficient processing, which is essential in healthcare environments where real-time response and accuracy are vital. We validated our approach across multiple datasets, including the Multiple Cameras Fall Dataset (MCFD), the UR fall dataset, and the NTU RGB +D Dataset, which demonstrates high accuracy in detecting falls and provides the interpretability of results.
Jiangjiao Li and Mengqi Gao and Peng Wang and Bin Li (2024) Fall detection algorithm based on pyramid network and feature fusion. Evolving Systems 15: 1957-1970 https://doi.org/10.1007/s12530-024-09601-9, Springer Nature, 10, Fall detection,Feature fusion,Gate recurrent unit,Pyramid network,Transformer, 5, 18686486, Accidental falls are the second leading cause of accidental death of the elderly. Early intervention measures can reduce the problem. However, so far, there are few related studies using Transformer coding module for fall detection feature extraction, and the real-time performance of existing algorithms is not so good. Therefore, we propose a fall detection method based on Transformer to extract spatiotemporal features. Specifically, we use an image reduction module based on a convolutional neural network to reduce the image size for computation. Then, we design a pyramid network based on an improved Transformer to extract spatial features. Finally, we design a feature fusion module that fuses spatial features of different scales. The fused features are input into the gate recurrent unit to extract time features and complete the recognition of falls and normal postures. Experimental results show that the proposed approach achieves an accuracy of 99.61% and 99.33% when tested with UR Fall Detection Dataset and Le2i Fall Detection Dataset. Compared with the state-of-the-art fall detection algorithms, our method has high accuracy while maintaining high detection speed.
Awais Khan and Jung-Yeon Kim and Chomyong Kim and Muhammad Attique Khan and Hyojin Shin and Jiyoung Woo and Yunyoung Nam (2025) Human fall direction recognition in the indoor and outdoor environment using multi self-attention RBnet deep architectures and tree seed optimization. Scientific Reports 15 https://doi.org/10.1038/s41598-025-11031-9, Springer Science and Business Media LLC, 8, 1, 20452322
Ali Raza and Muhammad Haroon Yousaf and Waqar Ahmad and Sergio A. Velastin and Serestina Viriri (2025) Human fall detection using pose estimation: From traditional machine learning to vision transformers. Engineering Applications of Artificial Intelligence 143 https://doi.org/10.1016/j.engappai.2024.109809, Elsevier Ltd, 3, Deep learning,Fall detection,Human pose estimation,Machine learning, 09521976, Human activity recognition research for healthcare has drawn global attention in recent era. Recent advancements have led to various approaches capable of detecting diverse movements like walking, running, jumping, and falling. Fall detection is crucial due to its potential fatality, especially for older individuals. Sensors are widely employed to perceive environmental changes, and they can be integrated into wearable devices like phones, necklaces, or wristbands. However, these devices may be uncomfortable or unsuitable for continuous use. Video imagery, in principle, surpasses wearable sensors for fall detection. The proposed method uses video frames to identify falls, reducing the need for environmental sensors. We present an empirical analysis of vision-based human fall detection, employing multiple techniques to estimate human poses including a transformer-based pose estimation technique. These techniques yield foundational features used for training diverse networks, including machine learning classifiers to vision transformers. Our methodology achieves cutting-edge outcomes across the UR-Fall, UP-Fall, and Le2i fall detection datasets.
Mingze Ma and Xiaofeng Hu (2025) A deep learning and edge computing integrated approach for fall behavior detection in buildings. Journal of Safety Science and Resilience 6 https://doi.org/10.1016/j.jnlssr.2025.100218, KeAi Communications Co., 12, Building,Deep learning,Edge computing,Fall behavior detection,STGCN,YOLOv11, 4, 26664496, This study proposes a model integrating YOLOv11 and STGCN for accurate and real-time detection of fall behaviors within buildings. A specialized video dataset comprising fall behaviors performed by six volunteers was developed and used to validate the model's effectiveness in cloud computing and edge computing environments. The results obtained in the cloud computing environment were characterized by ample computational resources and the absence of real-time constraints. The model achieved precision, recall, and F1-score for fall behaviors exceeding 0.98. The model was integrated into edge computing devices in an actual test environment to directly process real-time video stream data. A missed detection rate of 18 % was observed on the Jetson ORIN NX 16GB device, while the Jetson AGX Orin 64GB recorded a lower missed detection rate of 15 %. Similarly, a false alarm rate of 16 % was observed on the Jetson ORIN NX 16GB device and 12 % on the Jetson AGX Orin 64GB device. These performance differences between the high-performance cloud computing cluster and edge computing devices, as well as among different edge computing devices, may be attributed to variations in computational resources, data quality, and device parameters. The results demonstrate the potential of the proposed model for real-time fall detection in resource-constrained environments.
Rahman, Syed Ashiqur and Adjeroh, Donald (2019) Centroid of Age Neighborhoods: A Generalized Approach to Estimate Biological Age. IEEE, 1--4, 2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)
Bahdanau, Dzmitry and Cho, Kyunghyun and Bengio, Yoshua (2014) Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473
Imboden, Mary T and Harber, Matthew P and Whaley, Mitchell H and Finch, W Holmes and Bishop, Derron L and Kaminsky, Leonard A (2018) Cardiorespiratory fitness and mortality in healthy men and women. Journal of the American College of Cardiology 72(19): 2283--2292 Elsevier
Motiian, Saeid and Piccirilli, Marco and Adjeroh, Donald A and Doretto, Gianfranco (2017) Unified deep supervised domain adaptation and generalization. 5715--5725, Proceedings of the IEEE International Conference on Computer Vision
American College of Sports Medicine and others (2013) ACSM's guidelines for exercise testing and prescription. Lippincott Williams & Wilkins
Murphy, Kevin P (2012) Machine learning: A probabilistic perspective. MIT press
Ji, Shuiwang and Xu, Wei and Yang, Ming and Yu, Kai (2012) 3D convolutional neural networks for human action recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 35(1): 221--231 IEEE
Krizhevsky, Alex and Sutskever, Ilya and Hinton, Geoffrey E (2012) Imagenet classification with deep convolutional neural networks. 1097--1105, Advances in Neural Information Processing Systems
Horvath, Steve and Raj, Kenneth (2018) DNA methylation-based biomarkers and the epigenetic clock theory of ageing. Nature Reviews Genetics 19(6): 371 Nature Publishing Group
Pyrkov, Timothy V and Getmantsev, Evgeny and Zhurov, Boris and Avchaciov, Konstantin and Pyatnitskiy, Mikhail and Menshikov, Leonid and Khodova, Kristina and Gudkov, Andrei V and Fedichev, Peter O (2018) Quantitative characterization of biological age and frailty based on locomotor activity records. Aging (Albany NY) 10(10): 2973 Impact Journals, LLC
Wang, Zongwei and Tang, Xu and Luo, Weixin and Gao, Shenghua (2018) Face aging with identity-preserved conditional generative adversarial networks. 7939--7947, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
LeCun, Yann and Bengio, Yoshua and Hinton, Geoffrey (2015) Deep learning. Nature 521(7553): 436 Nature Publishing Group
Zhang, Ke and Liu, Na and Yuan, Xingfang and Guo, Xinyao and Gao, Ce and Zhao, Zhenbing (2018) Fine-Grained Age Estimation in the wild with Attention {LSTM} Networks. arXiv preprint arXiv:1805.10445
Rav{\`\i}, Daniele and Wong, Charence and Deligianni, Fani and Berthelot, Melissa and Andreu-Perez, Javier and Lo, Benny and Yang, Guang-Zhong (2016) Deep learning for health informatics. IEEE {J}ournal of {B}iomedical and {H}ealth {I}nformatics 21(1): 4--21 IEEE
Miotto, Riccardo and Wang, Fei and Wang, Shuang and Jiang, Xiaoqian and Dudley, Joel T (2017) Deep learning for healthcare: review, opportunities and challenges. Briefings in {B}ioinformatics 19(6): 1236--1246 Oxford University Press
Rahman, Syed Ashiqur and Adjeroh, Donald (2019) Centroid of Age Neighborhoods: A New Approach to Estimate Biological Age. IEEE {J}ournal of {B}iomedical and {H}ealth {I}nformatics IEEE
Rahman, Syed Ashiqur and Adjeroh, Donald A (2019) Deep Learning using Convolutional LSTM estimates Biological Age from Physical Activity. Scientific {R}eports 9(1): 1--15 Nature Publishing Group
Goodfellow, Ian and Pouget-Abadie, Jean and Mirza, Mehdi and Xu, Bing and Warde-Farley, David and Ozair, Sherjil and Courville, Aaron and Bengio, Yoshua (2014) Generative adversarial nets. 2672--2680, Advances in {N}eural {I}nformation {P}rocessing {S}ystems
Fu, Laiyi and Peng, Qinke and Chai, Ling (2019) Predicting DNA methylation states with hybrid information based deep-learning model. IEEE/ACM Transactions on Computational Biology and Bioinformatics IEEE
Liu, Feng and Li, Hao and Ren, Chao and Bo, Xiaochen and Shu, Wenjie (2016) PEDLA: predicting enhancers with a deep learning-based algorithmic framework. Scientific {R}eports 6: 28517 Nature Publishing Group
Cole, James H and Poudel, Rudra PK and Tsagkrasoulis, Dimosthenis and Caan, Matthan WA and Steves, Claire and Spector, Tim D and Montana, Giovanni (2017) Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker. NeuroImage 163: 115--124 Elsevier
Wang, Zichen and Li, Li and Glicksberg, Benjamin S and Israel, Ariel and Dudley, Joel T and Ma'ayan, Avi (2017) Predicting age by mining electronic medical records with deep learning characterizes differences between chronological and physiological age. Journal of Biomedical Informatics 76: 59--68 Elsevier
Hochschild, Richard (1989) Improving the precision of biological age determinations. Part 1: A new approach to calculating biological age. Experimental Gerontology 24(4): 289--300 Elsevier
Kr{\o}ll, J and Saxtrup, O (2000) On the use of regression analysis for the estimation of human biological age. Biogerontology 1(4): 363--368 Springer
Liu, Zuyun and Kuo, Pei-Lun and Horvath, Steve and Crimmins, Eileen and Ferrucci, Luigi and Levine, Morgan (2018) Phenotypic Age: A novel signature of mortality and morbidity risk. BioRxiv : 363291 Cold Spring Harbor Laboratory
Belsky, Daniel W and Moffitt, Terrie E and Cohen, Alan A and Corcoran, David L and Levine, Morgan E and Prinz, Joseph A and Schaefer, Jonathan and Sugden, Karen and Williams, Benjamin and Poulton, Richie and others (2017) Eleven telomere, epigenetic clock, and biomarker-composite quantifications of biological aging: do they measure the same thing?. American Journal of Epidemiology 187(6): 1220--1230 Oxford University Press
Mamoshina, Polina and Kochetov, Kirill and Putin, Evgeny and Cortese, Franco and Aliper, Alexander and Lee, Won-Suk and Ahn, Sung-Min and Uhn, Lee and Skjodt, Neil and Kovalchuk, Olga and others (2018) Population Specific Biomarkers of Human Aging: A Big Data Study Using {S}outh {K}orean, {C}anadian, and {E}astern {E}uropean Patient Populations. The Journals of Gerontology: Series A 73(11): 1482--1490 Oxford University Press US
Bobrov, Eugene and Georgievskaya, Anastasia and Kiselev, Konstantin and Sevastopolsky, Artem and Zhavoronkov, Alex and Gurov, Sergey and Rudakov, Konstantin and Tobar, Maria del Pilar Bonilla and Jaspers, S{\"o}ren and Clemann, Sven (2018) Photo{A}ge{C}lock: Deep learning algorithms for development of non-invasive visual biomarkers of aging. Aging (Albany NY) 10(11): 3249 Impact Journals, LLC
Zhavoronkov, Alex and Mamoshina, Polina and Vanhaelen, Quentin and Scheibye-Knudsen, Morten and Moskalev, Alexey and Aliper, Alex (2018) Artificial intelligence for aging and longevity research: Recent advances and perspectives. Ageing Research Reviews Elsevier
Srivastava, Nitish and Hinton, Geoffrey and Krizhevsky, Alex and Sutskever, Ilya and Salakhutdinov, Ruslan (2014) Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research 15(1): 1929--1958 JMLR. org
Dozat, Timothy (2016) Incorporating Nesterov Momentum into Adam. ICLR Workshop
M{\o}rkedal, Bj{\o}rn and Romundstad, P{\aa}l R and Vatten, Lars J (2011) Informativeness of indices of blood pressure, obesity and serum lipids in relation to ischaemic heart disease mortality: {T}he {HUNT-II} study. European Journal of Epidemiology 26(6): 457--461 Springer
Kingma, Diederik P and Ba, Jimmy (2014) Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980
Kaplan, Edward L and Meier, Paul (1958) Nonparametric estimation from incomplete observations. Journal of the American Statistical Association 53(282): 457--481 Taylor & Francis
DiPietro, L and Williamson, DF and Caspersen, CJ and Eaker, E (1993) The descriptive epidemiology of selected physical activities and body weight among adults trying to lose weight: the Behavioral Risk Factor Surveillance System survey, 1989.. International Journal of Obesity and Related Metabolic Disorders: Journal of the International Association for the Study of Obesity 17(2): 69--76
Stephens, Thomas and Caspersen, Carl J (1994) The demography of physical activity.. Physical Activity, Fitness, and Health: International Proceedings and Consensus Statement : 204--213 Champaign, IL, England: Human Kinetics Publishers
Caspersen, CJ and DiPietro, L (1991) National estimates of physical activity among older adults. Medicine and Science in Sports and Exercise 23(4 supplement): 106
DiPietro, Loretta (2001) Physical activity in aging: changes in patterns and their relationship to health and function. The Journals of Gerontology Series A: Biological Sciences and Medical Sciences 56(suppl\_2): 13--22 Oxford University Press
Guthold, Regina and Stevens, Gretchen A and Riley, Leanne M and Bull, Fiona C (2018) Worldwide trends in insufficient physical activity from 2001 to 2016: a pooled analysis of 358 population-based surveys with 1{\textperiodcentered} 9 million participants. The Lancet Global Health Elsevier
Glorot, Xavier and Bengio, Yoshua (2010) Understanding the difficulty of training deep feedforward neural networks. 249--256, Proceedings of the {T}hirteenth {I}nternational {C}onference on {A}rtificial {I}ntelligence and {S}tatistics
Kom, Edward L and Graubard, Barry I and Midthune, Douglas (1997) Time-to-event analysis of longitudinal follow-up of a survey: choice of the time-scale. American Journal of Epidemiology 145(1): 72--80 Oxford Univ Press
Cox, David Roxbee and Oakes, David (1984) Analysis of survival data. CRC Press, 21
Cevenini, E and Invidia, L and Lescai, Francesco and Salvioli, S and Tieri, P and Castellani, G and Franceschi, C (2008) Human models of aging and longevity. Expert opinion on biological therapy 8(9): 1393--1405 Taylor & Francis
Belsky, Daniel W and Caspi, Avshalom and Houts, Renate and Cohen, Harvey J and Corcoran, David L and Danese, Andrea and Harrington, HonaLee and Israel, Salomon and Levine, Morgan E and Schaefer, Jonathan D and others (2015) Quantification of biological aging in young adults. Proceedings of the National Academy of Sciences 112(30): E4104--E4110 National Acad Sciences
Mitnitski, Arnold and Howlett, Susan E and Rockwood, Kenneth (2016) Heterogeneity of human aging and its assessment. Journals of Gerontology Series A: Biomedical Sciences and Medical Sciences 72(7): 877--884 Oxford University Press US
Levine, Morgan E (2012) Modeling the rate of senescence: {C}an estimated biological age predict mortality more accurately than chronological age?. Journals of Gerontology Series A: Biomedical Sciences and Medical Sciences 68(6): 667--674 Oxford University Press US
Nakamura, E and Miyao, K and Ozeki, T (1988) Assessment of biological age by principal component analysis. Mechanisms of Ageing and Development 46(1): 1--18 Elsevier
Kr{\o}ll, J and Saxtrup, O (2000) On the use of regression analysis for the estimation of human biological age. Biogerontology 1(4): 363--368 Springer
Takeda, Hiroshi and Inada, Hiroshi and Inoue, Michitoshi and Yoshikawa, Hiromichi and Abe, Hiroshi (1982) Evaluation of biological age and physical age by multiple regression analysis. Medical Informatics 7(3): 221--227 Taylor & Francis
Xu, Cheng and Qu, Hongzhu and Wang, Guangyu and Xie, Bingbing and Shi, Yi and Yang, Yaran and Zhao, Zhao and Hu, Lan and Fang, Xiangdong and Yan, Jiangwei and others (2015) A novel strategy for forensic age prediction by {DNA} methylation and support vector regression model. Scientific Reports 5Nature Publishing Group
Guo, Guodong and Mu, Guowang and Fu, Yun and Huang, Thomas S (2009) Human age estimation using bio-inspired features. IEEE, 112--119, IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Fu, Yun and Huang, Thomas S (2008) Human age estimation with regression on discriminative aging manifold. IEEE Transactions on Multimedia 10(4): 578--584 IEEE
A
Horvath, Steve (2013) {DNA} methylation age of human tissues and cell types. Genome Biology 14(10): 3156 BioMed Central
Cole, JH and Ritchie, SJ and Bastin, ME and Hern{\'a}ndez MC Vald{\'e}s and Maniega, S Mu{\ n}oz and Royle, N and Corley, J and Pattie, A and Harris, SE and Zhang, Q and others (2017) Brain age predicts mortality. Molecular Psychiatry Nature Publishing Group
Sebastiani, Paola and Thyagarajan, Bharat and Sun, Fangui and Schupf, Nicole and Newman, Anne B and Montano, Monty and Perls, Thomas T (2017) Biomarker signatures of aging. Aging Cell 16(2): 329--338 Wiley Online Library
Piccirilli, Marco and Doretto, Gianfranco and Adjeroh, Donald (2017) {A} {F}ramework for {A}nalyzing the {W}hole {B}ody {S}urface {A}rea from a {S}ingle {V}iew. PLoS ONE 12(1): e0166749 Public Library of Science
Shamir, Lior (2011) Automatic age estimation by hand photos. Computer Science Letters 3(1)
Aoyagi, Kiyoshi and Ross, Philip D and Nevitt, Michael C and Davis, James W and Wasnich, Richard D and Hayashi, Takuo and Takemoto, Tai-ichiro (2001) Comparison of performance-based measures among native {J}apanese, {J}apanese-{A}mericans in {H}awaii and {C}aucasian women in the {U}nited {S}tates, ages 65 years and over: {A} cross-sectional study. BMC Geriatrics 1(1): 1 BioMed Central
Chilima, Dorothy M and Ismail, Suraiya J (2001) Nutrition and handgrip strength of older adults in rural {M}alawi. Public Health Nutrition 4(01): 11--17 Cambridge Univ Press
Vaz, M and Hunsberger, S and Diffey, B (2002) Prediction equations for handgrip strength in healthy {I}ndian male and female subjects encompassing a wide age range. Annals of Human Biology 29(2): 131--141 Taylor & Francis
Koley, Shyamal and Kaur, Navdeep and Sandhu, JS (2009) A study on hand grip strength in female labourers of {J}alandhar, {P}unjab, {I}ndia. J. Life Sci 1(1): 57--62
Sch{\"o}lkopf, Bernhard and Smola, Alexander and M{\"u}ller, Klaus-Robert (1997) {K}ernel {P}rincipal {C}omponent {A}nalysis. Springer, 583--588, International Conference on Artificial Neural Networks
Jolliffe, Ian (2002) {P}rincipal {C}omponent {A}nalysis. Wiley Online Library
Al Snih, Soham and Markides, Kyriakos S and Ray, Laura and Ostir, Glenn V and Goodwin, James S (2002) Handgrip strength and mortality in older {M}exican {A}mericans. Journal of the American Geriatrics Society 50(7): 1250--1256 Wiley Online Library
Ali, Naeem A and O'Brien Jr, James M and Hoffmann, Stephen P and Phillips, Gary and Garland, Allan and Finley, James CW and Almoosa, Khalid and Hejal, Rana and Wolf, Karen M and Lemeshow, Stanley and others (2008) Acquired weakness, handgrip strength, and mortality in critically ill patients. American Journal of Respiratory and Critical Care Medicine 178(3): 261--268 Am Thoracic Soc
Ling, Carolina HY and Taekema, Diana and de Craen, Anton JM and Gussekloo, Jacobijn and Westendorp, Rudi GJ and Maier, Andrea B (2010) Handgrip strength and mortality in the oldest old population: {T}he {L}eiden 85-plus study. Canadian Medical Association Journal 182(5): 429--435 Can Med Assoc
Bocklandt, Sven and Lin, Wen and Sehl, Mary E and S{\'a}nchez, Francisco J and Sinsheimer, Janet S and Horvath, Steve and Vilain, Eric (2011) Epigenetic predictor of age. PLoS ONE 6(6): e14821 Public Library of Science
Ramanathan, Narayanan and Chellappa, Rama (2006) Modeling age progression in young faces. IEEE, 387--394, 1, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06)
Campisi, Judith (2003) Cellular senescence and cell death. Physiological Basis of Aging and Geriatrics : 47--59
Campisi, J (1997) The biology of replicative senescence. European Journal of Cancer 33(5): 703--709 Elsevier
Richter, Christoph and Park, Jeen-Woo and Ames, Bruce N (1988) Normal oxidative damage to mitochondrial and nuclear {DNA} is extensive. Proceedings of the National Academy of Sciences 85(17): 6465--6467 National Acad Sciences
Harman, Denham (2003) The free radical theory of aging. Antioxidants and Redox Signaling 5(5): 557--561 Mary Ann Liebert, Inc.
Tosato, Matteo and Zamboni, Valentina and Ferrini, Alessandro and Cesari, Matteo (2007) The aging process and potential interventions to extend life expectancy. Clinical Interventions in Aging 2(3): 401 DOVE MEDICAL PRESS
Fischer, Krista and Kettunen, Johannes and W{\"u}rtz, Peter and Haller, Toomas and Havulinna, Aki S and Kangas, Antti J and Soininen, Pasi and Esko, T{\ o}nu and Tammesoo, Mari-Liis and M{\"a}gi, Reedik and others (2014) Biomarker profiling by nuclear magnetic resonance spectroscopy for the prediction of all-cause mortality: {A}n observational study of 17,345 persons. PLoS Med 11(2): e1001606 Public Library of Science
Ritz-Timme, S and Cattaneo, C and Collins, MJ and Waite, ER and Sch{\"u}tz, HW and Kaatsch, H-J and Borrman, HIM (2000) Age estimation: {T}he state of the art in relation to the specific demands of forensic practise. International Journal of Legal Medicine 113(3): 129--136 Springer
Anstey, Kaarin J and Lord, Stephen R and Smith, Glen A (1996) Measuring human functional age: {A} review of empirical findings. Experimental Aging Research 22(3): 245--266 Taylor & Francis
Kr{\"o}ger, Heikki and Kotaniemi, Antero and Kr{\"o}ger, Liisa and Alhava, Esko (1993) Development of bone mass and bone density of the spine and femoral neck —a prospective study of 65 children and adolescents. Bone and Mineral 23(3): 171--182 Elsevier
Krakauer, Nir Y and Krakauer, Jesse C (2012) A new body shape index predicts mortality hazard independently of {B}ody {M}ass {I}ndex. PLoS ONE 7(7): e39504 Public Library of Science
Geng, Xin and Zhou, Zhi-Hua and Smith-Miles, Kate (2007) Automatic age estimation based on facial aging patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(12): 2234--2240 IEEE
Williams, Guy (2001) A review of the most commonly used dental age estimation techniques. J Forensic Odontostomatol 19: 9--17
Fu, Yun and Guo, Guodong and Huang, Thomas S (2010) Age synthesis and estimation via faces: {A} survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 32(11): 1955--1976 IEEE
Han, Hu and Otto, Charles and Liu, Xiaoming and Jain, Anil K (2015) Demographic estimation from face images: Human vs. machine performance. IEEE Transactions on Pattern Analysis and Machine Intelligence 37(6): 1148--1161 IEEE
Chua, Caroline and Cote, Joseph A and Leong, Siew Meng (1990) The antecedents of cognitive age. NA-Advances in Consumer Research Volume 17
House, James S and Lepkowski, James M and Kinney, Ann M and Mero, Richard P and Kessler, Ronald C and Herzog, A Regula (1994) The social stratification of aging and health. Journal of Health and Social Behavior : 213--234 JSTOR
Schmeling, A and Grundmann, C and Fuhrmann, A and Kaatsch, H-J and Knell, B and Ramsthaler, F and Reisinger, W and Riepert, T and Ritz-Timme, S and R{\"o}sing, FW and others (2008) Criteria for age estimation in living individuals. International Journal of Legal Medicine 122(6): 457--460 Springer
Maddox, George L and Douglass, Elizabeth B (1973) Self-assessment of health: {A} longitudinal study of elderly subjects. Journal of Health and Social Behavior : 87--93 JSTOR
Nurmi, Jari-Erik (1987) Age, sex, social class, and quality of family interaction as determinants of adolescents' future orientation: {A} developmental task interpretation. Adolescence 22(88): 977 Libra Publishers.
Rahman, Syed Ashiqur and Adjeroh, Donald (2015) {S}urface-{B}ased {B}ody {S}hape {I}ndex and {I}ts {R}elationship with {A}ll-{C}ause {M}ortality. PLoS ONE 10(12): e0144639 Public Library of Science
Sanderson, Warren C and Scherbov, Sergei (2014) Measuring the speed of aging across population subgroups. PLoS ONE 9(5): e96289 Public Library of Science
Massy-Westropp, Nicola M and Gill, Tiffany K and Taylor, Anne W and Bohannon, Richard W and Hill, Catherine L (2011) Hand Grip Strength: {A}ge and gender stratified normative data in a population-based study. BMC Research Notes 4(1): 127 BioMed Central Ltd
Xia, Baiqiang and Amor, Boulbaba Ben and Daoudi, Mohamed and Drira, Hassen (2014) Can 3{D} shape of the face reveal your age?. IEEE, 5--13, 2, International Conference on Computer Vision Theory and Applications (VISAPP)
Guo, Guodong and Mu, Guowang and Fu, Yun and Dyer, Charles R and Huang, Thomas S (2009) A study on automatic age estimation using a large database.. 1986--1991, ICCV
Egilmez, Hilmi E and Ortega, Antonio (2014) Spectral anomaly detection using graph-based filtering for wireless sensor networks. IEEE, 1085--1089, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014
Belkin, Mikhail and Niyogi, Partha (2003) Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computation 15(6): 1373--1396 MIT Press
Klemera, Petr and Doubal, Stanislav (2006) A new approach to the concept and computation of biological age. Mechanisms of Ageing and Development 127(3): 240--248 Elsevier
Jackson, Stephen HD and Weale, Martin R and Weale, Robert A (2003) Biological age —what is it and can it be measured?. Archives of Gerontology and Geriatrics 36(2): 103--115 Elsevier
Han, Hu and Otto, Christina and Jain, Anubhav K (2013) Age estimation from face images: {H}uman vs. machine performance. IEEE, 1--8, Biometrics (ICB), 2013 International Conference on
Eikvil, Line and Kvaal, Sigrid Ingeborg and Teigland, Andr{\'e} and Haugen, Marion and Gr{\o}gaard, Jens (2012) Age estimation in youths and young adults. Norsk Regnesentral 12Citeseer
Cho, Il Haeng and Park, Kyung S and Lim, Chang Joo (2010) An empirical comparative study on biological age estimation algorithms with an application of {W}ork {A}bility {I}ndex ({WAI}). Mechanisms of Ageing and Development 131(2): 69--78 Elsevier
Putin, Evgeny and Mamoshina, Polina and Aliper, Alexander and Korzinkin, Mikhail and Moskalev, Alexey and Kolosov, Alexey and Ostrovskiy, Alexander and Cantor, Charles and Vijg, Jan and Zhavoronkov, Alex (2016) Deep biomarkers of human aging: {A}pplication of deep neural networks to biomarker development. Aging 8(5): 1--021
Rhodes, Matthew G (2009) Age estimation of faces: {A} review. Applied Cognitive Psychology 23(1): 1--12 Wiley Online Library
Alley, Thomas R (2013) {S}ocial and {A}pplied {A}spects of {P}erceiving {F}aces. Psychology Press
Mughal, Arsalan Manzoor and Hassan, Nuzhat and Ahmed, Anwar (2014) Bone age assessment methods: {A} critical review. Pakistan journal of medical sciences 30(1): 211 Professional Medical Publications
Zerilli, A and Baccino, E and Lamendin, H and Tavernier, JC and Humbert, JF and Nossintchouk, RM (1992) A simple technique for age estimation in adult corpses: {T}he two criteria dental method. Journal of Forensic Science 37(5): 1373--1379 ASTM International
Basak, Debasish and Pal, Srimanta and Patranabis, Dipak Chandra (2007) {S}upport {V}ector {R}egression. Neural Information Processing-Letters and Reviews 11(10): 203--224 Citeseer
Liaw, Andy and Wiener, Matthew (2002) Classification and {R}egression by random{F}orest. R news 2(3): 18--22
Maber, M and Liversidge, HM and Hector, MP (2006) Accuracy of age estimation of radiographic methods using developing teeth. Forensic Science International 159: S68--S73 Elsevier
Lu, Jiwen and Tan, Yap-Peng (2010) Gait-based human age estimation. IEEE Transactions on Information Forensics and Security 5(4): 761--770 IEEE
Wang, Xiaolong and Guo, Rui and Kambhamettu, Chandra (2015) Deeply-learned feature for age estimation. IEEE, 534--541, 2015 IEEE Winter Conference on Applications of Computer Vision
Guo, Guodong and Fu, Yun and Huang, Thomas S and Dyer, Charles R (2008) Locally adjusted robust regression for human age estimation. IEEE Workshop on Applications of Computer Vision 51: 61801
Eipel, Monika and Mayer, Felix and Arent, Tanja and Ferreira, Marcelo RP and Birkhofer, Carina and Gerstenmaier, Uwe and Costa, Ivan G and Ritz-Timme, Stefanie and Wagner, Wolfgang (2016) Epigenetic age predictions based on buccal swabs are more precise in combination with cell type-specific {DNA} methylation signatures. Aging 8(5)
Guo, Guodong and Mu, Guowang (2011) Simultaneous dimensionality reduction and human age estimation via kernel partial least squares regression. IEEE, 657--664, Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
Ni, Bingbing and Song, Zheng and Yan, Shuicheng (2009) Web image mining towards universal age estimator. ACM, 85--94, Proceedings of the 17th ACM International Conference on Multimedia
Geng, Xin and Yin, Chao and Zhou, Zhi-Hua (2013) Facial age estimation by learning from label distributions. IEEE Transactions on Pattern Analysis and Machine Intelligence 35(10): 2401--2412 IEEE
Robinette, Kathleen M and Daanen, Hans and Paquet, Eric (1999) The CAESAR project: {A} 3-{D} surface anthropometry survey. IEEE, 380--386, 3-D Digital Imaging and Modeling, 1999. Proceedings. Second International Conference on
Adjeroh, Donald and Cao, Deng and Piccirilli, Marco and Ross, Arun (2010) Predictability and correlation in human metrology. IEEE, 1--6, Information Forensics and Security (WIFS), 2010 IEEE International Workshop on
Cao, Deng and Chen, Cunjian and Adjeroh, Donald and Ross, Arun (2012) Predicting gender and weight from human metrology using a copula model. IEEE, 162--169, Biometrics: Theory, Applications and Systems (BTAS), 2012 IEEE Fifth International Conference on
Box, George EP and Cox, David R (1964) An analysis of transformations. Journal of the Royal Statistical Society. Series B (Methodological) : 211--252 JSTOR
Xingjian, SHI and Chen, Zhourong and Wang, Hao and Yeung, Dit-Yan and Wong, Wai-Kin and Woo, Wang-chun (2015) Convolutional LSTM network: A machine learning approach for precipitation nowcasting. 802--810, Advances in Neural Information Processing Systems
Hochreiter, Sepp and Schmidhuber, J{\"u}rgen (1997) Long short-term memory. Neural Computation 9(8): 1735--1780 MIT Press
Pyrkov, Timothy V and Slipensky, Konstantin and Barg, Mikhail and Kondrashin, Alexey and Zhurov, Boris and Zenin, Alexander and Pyatnitskiy, Mikhail and Menshikov, Leonid and Markov, Sergei and Fedichev, Peter O (2018) Extracting biological age from biomedical data via deep learning: too much of a good thing?. Scientific Reports 8(1): 5210 Nature Publishing Group
Altmann, Andr{\'e} and Tolo{\c{s}}i, Laura and Sander, Oliver and Lengauer, Thomas (2010) Permutation importance: {A} corrected feature importance measure. Bioinformatics 26(10): 1340--1347 Oxford University Press
Hamano, Yuya and Manabe, Sho and Morimoto, Chie and Fujimoto, Shuntaro and Tamaki, Keiji (2017) Forensic age prediction for saliva samples using methylation-sensitive high resolution melting: exploratory application for cigarette butts. Scientific {R}eports 7(1): 10444 Nature Publishing Group
Xu, Cheng and Qu, Hongzhu and Wang, Guangyu and Xie, Bingbing and Shi, Yi and Yang, Yaran and Zhao, Zhao and Hu, Lan and Fang, Xiangdong and Yan, Jiangwei and others (2015) A novel strategy for forensic age prediction by {DNA} methylation and support vector regression model. Scientific {R}eports 5Nature Publishing Group
Piccirilli, Marco and Doretto, Gianfranco and Adjeroh, Donald (2017) {A} {F}ramework for {A}nalyzing the {W}hole {B}ody {S}urface {A}rea from a {S}ingle {V}iew. PLoS ONE 12(1): e0166749 Public Library of Science
Shamir, Lior (2011) Automatic age estimation by hand photos. Computer Science Letters 3(1)
Aoyagi, Kiyoshi and Ross, Philip D and Nevitt, Michael C and Davis, James W and Wasnich, Richard D and Hayashi, Takuo and Takemoto, Tai-ichiro (2001) Comparison of performance-based measures among native {J}apanese, {J}apanese-{A}mericans in {H}awaii and {C}aucasian women in the {U}nited {S}tates, ages 65 years and over: {A} cross-sectional study. BMC Geriatrics 1(1): 1 BioMed Central
Chilima, Dorothy M and Ismail, Suraiya J (2001) Nutrition and handgrip strength of older adults in rural {M}alawi. Public Health Nutrition 4(01): 11--17 Cambridge Univ Press
Vaz, M and Hunsberger, S and Diffey, B (2002) Prediction equations for handgrip strength in healthy {I}ndian male and female subjects encompassing a wide age range. Annals of Human Biology 29(2): 131--141 Taylor & Francis
Koley, Shyamal and Kaur, Navdeep and Sandhu, JS (2009) A study on hand grip strength in female labourers of {J}alandhar, {P}unjab, {I}ndia. J. Life Sci 1(1): 57--62
Sch{\"o}lkopf, Bernhard and Smola, Alexander and M{\"u}ller, Klaus-Robert (1997) {K}ernel {P}rincipal {C}omponent {A}nalysis. Springer, 583--588, International Conference on Artificial Neural Networks
Jolliffe, Ian (2002) {P}rincipal {C}omponent {A}nalysis. Wiley Online Library
Al Snih, Soham and Markides, Kyriakos S and Ray, Laura and Ostir, Glenn V and Goodwin, James S (2002) Handgrip strength and mortality in older {M}exican {A}mericans. Journal of the American Geriatrics Society 50(7): 1250--1256 Wiley Online Library
Ali, Naeem A and O'Brien Jr, James M and Hoffmann, Stephen P and Phillips, Gary and Garland, Allan and Finley, James CW and Almoosa, Khalid and Hejal, Rana and Wolf, Karen M and Lemeshow, Stanley and others (2008) Acquired weakness, handgrip strength, and mortality in critically ill patients. American Journal of Respiratory and Critical Care Medicine 178(3): 261--268 Am Thoracic Soc
Ling, Carolina HY and Taekema, Diana and de Craen, Anton JM and Gussekloo, Jacobijn and Westendorp, Rudi GJ and Maier, Andrea B (2010) Handgrip strength and mortality in the oldest old population: {T}he {L}eiden 85-plus study. Canadian Medical Association Journal 182(5): 429--435 Can Med Assoc
Bocklandt, Sven and Lin, Wen and Sehl, Mary E and S{\'a}nchez, Francisco J and Sinsheimer, Janet S and Horvath, Steve and Vilain, Eric (2011) Epigenetic predictor of age. PLoS ONE 6(6): e14821 Public Library of Science
Ramanathan, Narayanan and Chellappa, Rama (2006) Modeling age progression in young faces. IEEE, 387--394, 1, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06)
Campisi, Judith (2003) Cellular senescence and cell death. Physiological Basis of Aging and Geriatrics : 47--59
Campisi, J (1997) The biology of replicative senescence. European Journal of Cancer 33(5): 703--709 Elsevier
Richter, Christoph and Park, Jeen-Woo and Ames, Bruce N (1988) Normal oxidative damage to mitochondrial and nuclear {DNA} is extensive. Proceedings of the National Academy of Sciences 85(17): 6465--6467 National Acad Sciences
Harman, Denham (2003) The free radical theory of aging. Antioxidants and Redox Signaling 5(5): 557--561 Mary Ann Liebert, Inc.
Tosato, Matteo and Zamboni, Valentina and Ferrini, Alessandro and Cesari, Matteo (2007) The aging process and potential interventions to extend life expectancy. Clinical Interventions in Aging 2(3): 401 DOVE MEDICAL PRESS
Ritz-Timme, S and Cattaneo, C and Collins, MJ and Waite, ER and Sch{\"u}tz, HW and Kaatsch, H-J and Borrman, HIM (2000) Age estimation: {T}he state of the art in relation to the specific demands of forensic practise. International Journal of Legal Medicine 113(3): 129--136 Springer
Kr{\"o}ger, Heikki and Kotaniemi, Antero and Kr{\"o}ger, Liisa and Alhava, Esko (1993) Development of bone mass and bone density of the spine and femoral neck —a prospective study of 65 children and adolescents. Bone and Mineral 23(3): 171--182 Elsevier
Geng, Xin and Zhou, Zhi-Hua and Smith-Miles, Kate (2007) Automatic age estimation based on facial aging patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(12): 2234--2240 IEEE
Williams, Guy (2001) A review of the most commonly used dental age estimation techniques. J Forensic Odontostomatol 19: 9--17
Fu, Yun and Guo, Guodong and Huang, Thomas S (2010) Age synthesis and estimation via faces: {A} survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 32(11): 1955--1976 IEEE
Chua, Caroline and Cote, Joseph A and Leong, Siew Meng (1990) The antecedents of cognitive age. NA-Advances in Consumer Research Volume 17
House, James S and Lepkowski, James M and Kinney, Ann M and Mero, Richard P and Kessler, Ronald C and Herzog, A Regula (1994) The social stratification of aging and health. Journal of Health and Social Behavior : 213--234 JSTOR
Schmeling, A and Grundmann, C and Fuhrmann, A and Kaatsch, H-J and Knell, B and Ramsthaler, F and Reisinger, W and Riepert, T and Ritz-Timme, S and R{\"o}sing, FW and others (2008) Criteria for age estimation in living individuals. International Journal of Legal Medicine 122(6): 457--460 Springer
Maddox, George L and Douglass, Elizabeth B (1973) Self-assessment of health: {A} longitudinal study of elderly subjects. Journal of Health and Social Behavior : 87--93 JSTOR
Nurmi, Jari-Erik (1987) Age, sex, social class, and quality of family interaction as determinants of adolescents' future orientation: {A} developmental task interpretation. Adolescence 22(88): 977 Libra Publishers.
Sanderson, Warren C and Scherbov, Sergei (2014) Measuring the speed of aging across population subgroups. PLoS ONE 9(5): e96289 Public Library of Science
Massy-Westropp, Nicola M and Gill, Tiffany K and Taylor, Anne W and Bohannon, Richard W and Hill, Catherine L (2011) Hand Grip Strength: {A}ge and gender stratified normative data in a population-based study. BMC Research Notes 4(1): 127 BioMed Central Ltd
Xia, Baiqiang and Amor, Boulbaba Ben and Daoudi, Mohamed and Drira, Hassen (2014) Can 3{D} shape of the face reveal your age?. IEEE, 5--13, 2, International Conference on Computer Vision Theory and Applications (VISAPP)
Guo, Guodong and Mu, Guowang and Fu, Yun and Dyer, Charles R and Huang, Thomas S (2009) A study on automatic age estimation using a large database.. 1986--1991, ICCV
Egilmez, Hilmi E and Ortega, Antonio (2014) Spectral anomaly detection using graph-based filtering for wireless sensor networks. IEEE, 1085--1089, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014
Belkin, Mikhail and Niyogi, Partha (2003) Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computation 15(6): 1373--1396 MIT Press
Han, Hu and Otto, Christina and Jain, Anubhav K (2013) Age estimation from face images: {H}uman vs. machine performance. IEEE, 1--8, Biometrics (ICB), 2013 International Conference on
Eikvil, Line and Kvaal, Sigrid Ingeborg and Teigland, Andr{\'e} and Haugen, Marion and Gr{\o}gaard, Jens (2012) Age estimation in youths and young adults. Norsk Regnesentral 12Citeseer
Rhodes, Matthew G (2009) Age estimation of faces: {A} review. Applied Cognitive Psychology 23(1): 1--12 Wiley Online Library
Alley, Thomas R (2013) {S}ocial and {A}pplied {A}spects of {P}erceiving {F}aces. Psychology Press
Mughal, Arsalan Manzoor and Hassan, Nuzhat and Ahmed, Anwar (2014) Bone age assessment methods: {A} critical review. Pakistan journal of medical sciences 30(1): 211 Professional Medical Publications
Zerilli, A and Baccino, E and Lamendin, H and Tavernier, JC and Humbert, JF and Nossintchouk, RM (1992) A simple technique for age estimation in adult corpses: {T}he two criteria dental method. Journal of Forensic Science 37(5): 1373--1379 ASTM International
Basak, Debasish and Pal, Srimanta and Patranabis, Dipak Chandra (2007) {S}upport {V}ector {R}egression. Neural Information Processing-Letters and Reviews 11(10): 203--224 Citeseer
Liaw, Andy and Wiener, Matthew (2002) Classification and {R}egression by random{F}orest. R news 2(3): 18--22
Maber, M and Liversidge, HM and Hector, MP (2006) Accuracy of age estimation of radiographic methods using developing teeth. Forensic Science International 159: S68--S73 Elsevier
Lu, Jiwen and Tan, Yap-Peng (2010) Gait-based human age estimation. IEEE Transactions on Information Forensics and Security 5(4): 761--770 IEEE
Wang, Xiaolong and Guo, Rui and Kambhamettu, Chandra (2015) Deeply-learned feature for age estimation. IEEE, 534--541, 2015 IEEE Winter Conference on Applications of Computer Vision
Guo, Guodong and Fu, Yun and Huang, Thomas S and Dyer, Charles R (2008) Locally adjusted robust regression for human age estimation. IEEE Workshop on Applications of Computer Vision 51: 61801
Eipel, Monika and Mayer, Felix and Arent, Tanja and Ferreira, Marcelo RP and Birkhofer, Carina and Gerstenmaier, Uwe and Costa, Ivan G and Ritz-Timme, Stefanie and Wagner, Wolfgang (2016) Epigenetic age predictions based on buccal swabs are more precise in combination with cell type-specific {DNA} methylation signatures. Aging 8(5)
Guo, Guodong and Mu, Guowang (2011) Simultaneous dimensionality reduction and human age estimation via kernel partial least squares regression. IEEE, 657--664, Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
Ni, Bingbing and Song, Zheng and Yan, Shuicheng (2009) Web image mining towards universal age estimator. ACM, 85--94, Proceedings of the 17th ACM International Conference on Multimedia
Geng, Xin and Yin, Chao and Zhou, Zhi-Hua (2013) Facial age estimation by learning from label distributions. IEEE Transactions on Pattern Analysis and Machine Intelligence 35(10): 2401--2412 IEEE
Robinette, Kathleen M and Daanen, Hans and Paquet, Eric (1999) The CAESAR project: {A} 3-{D} surface anthropometry survey. IEEE, 380--386, 3-D Digital Imaging and Modeling, 1999. Proceedings. Second International Conference on
Adjeroh, Donald and Cao, Deng and Piccirilli, Marco and Ross, Arun (2010) Predictability and correlation in human metrology. IEEE, 1--6, Information Forensics and Security (WIFS), 2010 IEEE International Workshop on
Cao, Deng and Chen, Cunjian and Adjeroh, Donald and Ross, Arun (2012) Predicting gender and weight from human metrology using a copula model. IEEE, 162--169, Biometrics: Theory, Applications and Systems (BTAS), 2012 IEEE Fifth International Conference on
A
Acknowledgement
The authors would like to express their sincere appreciation to the Universidad de San Buenaventura, Cali, Colombia, for its continuous institutional support and commitment to fostering research and innovation.
Additional Files
Additional file 14
Click here to Correct
Additional file 15
Click here to Correct
Additional file 16
Click here to Correct
Additional file 17
Click here to Correct
Additional file 18
Click here to Correct
Additional file 19
Click here to Correct
Additional file 21
Click here to Correct
Additional file 22
Click here to Correct
Additional file 23
Click here to Correct
Additional file 24
Click here to Correct
Additional file 25
Click here to Correct
Additional file 26
Click here to Correct
Additional file 27
Click here to Correct
Total words in MS: 4333
Total words in Title: 14
Total words in Abstract: 120
Total Keyword count: 6
Total Images in MS: 9
Total Tables in MS: 0
Total Reference count: 210