Abstract
Background Pelvic Incidence (PI) plays a crucial role in surgical planning. However, it is insufficient for accurately predicting spinal alignment parameters, including Sacral Slope, Pelvic Tilt, and Lumbar Lordosis. We have devised an AI-based method for predicting sagittal spinal alignments with enhanced precision.
Methods We have developed an AI-based system utilizing a Seq2Seq framework to model the spatial correlation between pelvic and spinal key points. This system was trained on a dataset of 337 cases and evaluated using 51 cases obtained from a multi-centre hospital. To address the issue of pelvic rotation, we introduced an Angle Correlation Network. We compared the performance of our AI-based system in predicting spinal alignment against the traditional PI-based method. This comparison was conducted using Mean Absolute Error (MAE) and the Correlation Coefficient (R value) as evaluation metrics.
Results We evaluated the performance of our AI-based system for predicting Sacral Slope (SS), Pelvic Tilt (PT), and Lumbar Lordosis (LL) values. The Pearson correlation coefficient of the AI-based method surpassed that of the PI-based method (0.80 vs 0.67 for SS, 0.73 vs 0.52 for PT, and 0.76 vs 0.48 for LL), indicating a more robust linear relationship between AI predictions and actual values. Additionally, the AI-based method exhibited a lower Mean Absolute Error (MAE) compared to the PI-based method for LL (5.52 vs 6.69), signifying enhanced prediction accuracy.
Conclusions In this study, we demonstrated the potential of an AI-based approach for predicting sagittal spinal alignments with improved precision compared to the traditional PI-based method. The AI-based system, utilizing a Seq2Seq framework and an Angle Correlation Network, exhibited a stronger linear relationship between predicted and actual values for Sacral Slope, Pelvic Tilt, and Lumbar Lordosis, as well as a reduced Mean Absolute Error for Lumbar Lordosis. These findings support the integration of AI in spinal surgery planning and personalized medicine for sagittal alignment evaluation and management.
Introduction
Bipedalism in Homo sapiens has led to the development of extraordinary sagittal spinopelvic alignments, which enable an energetically economical upright stance due to the reciprocal sagittal curvatures [1]. A healthy, physically mature individual typically possesses a unique spinopelvic alignment characterized by the harmony among pelvic, lumbar, thoracic, and cervical segments. Among these segments, the pelvis lacks movable intervertebral discs and remains relatively constant, even when spinal segments undergo malformation. Consequently, the pelvic sequence can be used to predict original spinal alignments due to its correlation with other spinal segments and invariable morphology [2-17].
Pelvic incidence (PI), defined as the angle between the line perpendicular to the superior plate of S1 at its midpoint and the line connecting this point to the axis of the femoral heads, has been investigated as an invariant morphological parameter of the pelvis [2]. PI has been shown to have strong correlations with pelvic tilt (PT), sacral slope (SS), and lumbar lordosis (LL). As a result, PI is widely used to predict the original spinopelvic alignments of patients with spinal deformity and inform surgical planning [2-17].
However, the current method has several limitations. Firstly, PI is a simplistic graphical angle, providing minimal information about the pelvic sequence and neglecting potentially crucial aspects of pelvic morphology. Secondly, using PI to predict the sagittal sequence of the spine is overly general and lacks specificity, rendering it impossible to discern local spinal sequences. Thirdly, all predictive sequence information is non-visible. Owing to these limitations, the existing method for predicting spinal sagittal alignments is insufficient for surgical design requirements.
Machine learning, particularly deep learning, has demonstrated superiority over human intelligence and has been successfully applied to medical data for clinical purposes [17-20]. Therefore, in this study, we utilized AI technology to conduct correlation analyses of pelvic and spinal sequences, introducing an innovative method for predicting sagittal spinal alignments. Our approach aims to achieve higher accuracy and more direct-viewing information than previous studies.
Method
1. Building an AI-based system to model spatial correlation between pelvic and spinal key points
In this study, we introduce a novel approach for modeling the spatial correlation between pelvic and spinal key points. Our system development involved the inclusion of 337 healthy adults, aged 18 to 35 years, with full-length, free-standing lateral radiographs from our institute. Two orthopedic spine surgeons reached a consensus and annotated the X-ray images by manually recording the spatial coordinates of the following pelvic and spinal key points, as illustrated in Figure 1:
12 sacral key points, including posterior edge of the S1 endplate, sacral promontory, posterior edge of the S2-5 superior endplates, anterior edge of the S2-5 superior endplates, posterior edge of the Co1 superior endplate, anterior edge of the Co1 superior endplate, and center of the Co1-Co2 intervertebral junction.
11 key points for the Ilium and Ischium, including midpoint of the line connecting the posterior superior iliac spine and posterior inferior iliac spine, apex of the greater sciatic notch, ischial spine, posterior superior edge of the ischial tuberosity, anterior inferior edge of the ischial tuberosity, pubic symphysis, anterior inferior iliac spine, anterior superior iliac spine, line connecting the anterior superior iliac spine and iliac crest apex, and iliac crest apex.
2 femoral head centers.
59 vertebral body key points for the lumbar, thoracic, and cervical spine.
The collection of sacrum, femoral head and hip key points is also referred to as “the shape of a pelvis” in the remaining of this paper. The collection of the 59 vertebral body key points is also referred to as “the shape of a spine” in the remaining of this paper. We random selected 63 cases from the 337 cases as a validation dataset for hyperparameter tuning and model selection. The remaining 274 cases were used to train a deep learning model based on the Seq2Seq architecture. We further collected another dataset from multicenter institutes, containing 51 well-positioned full length standing lateral x-ray images, to form a test dataset to evaluate the performance of the proposed system against other established approaches.
Spatial correlation between pelvic and spinal key points were formulated as a supervised regression problem. The independent variable was the shape of a pelvis depicted by a collection of pelvic key point coordinates while the dependent variable was the shape of a spine depicted by a collection of spinal key point coordinates and endplate orientation vectors. A Seq2Seq model is a special type of deep learning models that leverages its recurrent mechanism to model complex long-term spatial or temporal data. Generally, a Seq2Seq model is composed of an encoder and a decoder module. The encoder models the pelvic shape in our case, to capture its intra-class correlation among pelvic key points. The resulting correlation information (or memory) is then fed into the decoder module to model correlation between pelvic shape and spinal shape. In our training setup, the outputs of the decoder module were the regressed offset vectors, forming a zig-zag path, which facilitates learning compared to other formulations Figure 2. The Seq2Seq model was optimized by the Adam optimizer using the combination of a mean-square loss term for coordinate regression and a cosine similarity loss term for orientation regression. A minibatch of 16 samples sufficed in our case to converge after 1,000 epochs.
Rotation is another issue when it comes to exploring pelvic-spinal correspondence. The physiological rotation of pelvic makes it hard to find a “neutral position”. To overcome this issue, we propose a novel Angle Correlation Network (ACN) that rotates input shapes by a predicted angle that would minimize the overall training loss. Throughout the training, ACN learns to address the neutral position problem by itself, as shown in Figure 3.
During development of the system, we find it critical to conduct pre-processing steps to stabilize training and increase generalization ability. Specifically, a Procrustes analysis was performed on each pair of the pelvic and spinal shape. The aim was to obtain a similar scale, orientation and placement by minimizing shape difference between the training samples so that the model could focus on the given task. The Procrustes analysis contained three steps: translation, uniform scaling, and rotation. The first two steps were straight-forward to perform. The rotation step was conducted by comparing to a randomly selected reference shape from the training dataset. The inverse steps were performed accordingly on the model predictions in the post-processing steps in order to recover the scale and orientation of the input. The overview of the proposed system was shown in Figure 1.
2. Performance of the AI-based system compared to a classic clinical study
To evaluate the performance of the AI-based system in predicting sacral slope (SS), pelvic tilt (PT) and lumbar lordosis (LL), a comparison was made with a classic clinical study. The methodology for this comparison is outlined below.
Dataset preparation and model selection
The dataset used in this study was divided into three subsets: training, validation, and testing. The training dataset was used to train various AI models, while the validation dataset was employed to fine-tune the models and select the best-performing one based on pre-defined evaluation metrics. The held-out dataset, which contained 51 cases, was reserved for testing the performance of the selected model. A Seq2Seq model was chosen as the AI-based system for this study due to its ability to learn complex patterns and relationships within the data. The best-performing Seq2Seq model was identified according to its performance on the validation dataset.
Evaluation metrics
To assess the performance of the selected Seq2Seq model, two evaluation metrics were employed: 1) Mean Absolute Error (MAE): This metric calculates the average magnitude of the errors between the predicted values and the true values of LL and TK. A lower MAE indicates better model performance. 2) Correlation Coefficient (R value): This metric measures the strength and direction of the linear relationship between the predicted values and the true values of SS, PT and LL. A higher R value indicates a strong correlation between the predictions and the true values, suggesting better model performance.
Classic clinical study
A classic clinical study (SS=0.63PI+7, PT=0.37PI-7 [3], LL=0.62PI+17.6 [7] was chosen for comparison to evaluate the effectiveness of the AI-based system. In this study, researchers used regression analysis and other statistical methods to predict SS, PT and LL values based on available clinical and radiographic data. The performance of the classic clinical study was assessed using the same evaluation metrics (MAE and R value) as those employed for the AI-based system.
Comparison and analysis
We randomly rotate the pelvic images of the 51 test subjects to be tested, with rotation angles ranging from -60 to 60 degrees. After the rotation, we further process the data for comparison. The performance of the AI-based system and the classic clinical study was compared in terms of the evaluation metrics (MAE and R value). The results were then analysed to determine the effectiveness of the AI-based system in predicting SS, PT and LL values relative to the classic clinical study. To ensure a rigorous comparison, potential confounding factors, such as differences in patient populations or data quality, were taken into account. In addition, the generalizability of the AI-based system to diverse patient populations was considered, particularly if the training data was not representative of the target patient group. By comparing the performance of the AI-based system and the classic clinical study using the specified methodology, the study aimed to demonstrate the potential advantages and limitations of using an AI-based approach for predicting SS, PT and LL values in patients.
Result
The results of our study are presented in Table 1, Table 2, Figure 4, and Figure 5. Table 1 presents the details of the 51 testing cases, and Table 2 presents the statistical description, Mean Absolute Error (MAE) and Pearson correlation coefficient for PI and AI predictions on SS, PT, and LL variables.
For the SS, PT and LL variable, the Pearson correlation coefficient for AI predictions was higher than that for PI predictions (0.80 vs 0.67, 0.73 vs 0.52, 0.76 vs 0.48), demonstrating a stronger linear relationship between AI predictions and the true values. In addition, AI predictions showed a lower MAE compared to PI predictions on LL (5.52 vs 6.69).
From Figures 4 and 5, we can see that the PI predictions for SS, PT, and LL perform reasonably well in the median range but are less satisfactory at the two ends of the normal distribution. In comparison, AI not only performs well in the median range but also demonstrates impressive results at the extreme values.
The AI prediction method consistently outperforms the PI prediction method.
Discussion
In the 1990s, Legaye and Dubousset introduced sagittal pelvic parameters (PI, PT, and SS) to the field of spinal research [2]. Lots of scholars used these pelvic parameters, along with lumbar lordosis (LL) and other factors, to analyze and classify the sagittal sequence of normal individuals in an upright posture [2-17]. Their work revealed the relationship between these parameters and numerous spinal disorders.
More importantly, many studies have confirmed that having a good match between the pelvic parameters and the lumbar sequence is crucial for diagnosing and treating spinal disorders, as well as for avoiding adverse postoperative events. As a result, the evaluation systems for pelvic parameters (PI, SS and PT) and LL have become a research hotspot in the spinal field.
Vialle, R et al recruited 300 healthy volunteers and abstained SS=0.63PI+7 and PT=0.37PI-7 [3]. Schwab et al. proposed that LL= PI±10° can be used as a rough approximation of the relationship between pelvic parameters and lumbar lordosis in an upright posture [10]. Mac-Thiong and Roussouly, on the other hand, derived a more accurate formula based on a larger sample size: LL=0.62PI+17.6 [7].
Researchers from various countries have also developed formulas for predicting LL using PI as the basis, based on their own experimental samples. In the study, due to the test sample size (51 cases), R-value was not as high as in previous literature, but the result showed that AI was better than PI in predicting SS, PT and LL.
To date, all mainstream spinal-pelvic sagittal studies both domestically and internationally have been based solely on the PI system [2-17]. However, PI is just a simple angular measurement of pelvic morphology. Although it largely reflects the corresponding pelvic morphology, a single angle cannot accurately represent the many specific features of an individual’s pelvic morphology. Different features of pelvic morphology may have varying impacts on spinal matching. Therefore, relying solely on the PI system to predict LL using pelvic parameters may cause us to overlook other useful pelvic morphological parameters.
We cannot deny that in the era of human computational power, PI has been the best and most straightforward pelvic morphological parameter for predicting LL. However, we have now entered the era of AI computational power, where AI’s superior computing capabilities eliminate the need to consider the significant costs of human labor and time [20]. Therefore, through AI-based analysis and prediction of the pelvis and spine, we can avoid losing any data and not waste any potential meaningful opportunities [19]. We developed an AI-based system using a Seq2Seq algorithm to model the spatial correlation between pelvic and spinal keypoints [18]. The system was trained on 274 cases and evaluated on a test dataset of 65 cases. A novel Angle Correlation Network was proposed to address the issue of rotation. The system outperformed a classic clinical study in terms of both mean absolute error and correlation coefficient, using MAE and R value as evaluation metric. In this research, we manually annotated key points of the pelvis and spine on standing lateral full-length radiographs. The relationships between these key points generate much more data than the simple relationship between PI and LL. In fact, not only can we obtain the LL value of the spinal sequence, but we can also obtain more detailed local sequence relationships, such as the lower lumbar lordosis angle of L4-S1, and so on. All this data can be presented in a visualized form, which is something the PI system cannot achieve.
There are still some shortcomings in this study, as we manually annotated key points of the pelvis and predicted their relationships. Further research is needed to: 1) improve AI-based automatic recognition of key points and prediction, 2) abandon key point recognition and use purely morphological learning for prediction, which relies on a more extensive data sample, and 3) improve three-dimensional morphological recognition and prediction using technology like CT three-dimensional reconstruction, EOS, and others. We firmly believe that this is just the beginning of AI-based research on the sagittal sequence of the spine and pelvis, and it will ultimately far surpass human capabilities.
Conclusion
In conclusion, this study demonstrates the potential of deep learning-based methods in predicting spinal sagittal alignments. Our Seq2Seq-based AI model outperformed the traditional method based on PI. By providing accurate and detailed predictions, the AI model has a potential to improve surgical planning and outcomes for patients with spinal diseases.
Data Availability
All data produced in the present study are available upon reasonable request to the authors