Introduction
Total knee arthroplasty (TKA) is the gold standard surgical treatment for knee osteoarthritis, which has been shown to reduce knee pain and improve function for most patients (Steinhaus, Christ, and Cross 2017). The number of TKAs performed in the United States is expected to increase dramatically over the next few decades, with conservative projections predicting an increase of 143% by 2050 (Inacio et al. 2017). In efforts to improve function and reduce knee pain following TKA, modifications and innovations in TKA design have been introduced.
The development of highly cross-linked polyethylene is one of the best examples of implant design improving revision rate, thereby saving money for the healthcare system (Lombardi, Berend, and Adams 2014). Prior to its introduction, polyethylene wear was a leading cause of TKA revision (Lombardi, Berend, and Adams 2014; Tarazi et al. 2021). Recent studies have found that revisions indicated for polyethylene wear are now less common than aseptic loosening, instability, and infection (Lombardi, Berend, and Adams 2014; Tarazi et al. 2021). Similarly, other TKA innovations, such as cementless TKA fixation and asymmetric pivoting TKA designs, report superior patient outcomes while integrating today’s technological advances into implant design (Mont et al. 2017; Harris et al. 2019; Batra et al. 2021). To ensure good value for the patient and society, it is important to demonstrate that the additional costs for developing new implants are associated with improved outcomes.
Obviously, a variety of outcome measures could be evaluated; however, the two most critical are patient satisfaction and implant survivorship. Each of these requires a different approach, with satisfaction measured by patient-reported outcomes measures (PROMs) and survivorship evaluated by revision rate. This comparison, which must include multiple contemporary and legacy implants, is virtually impossible to perform at a single institution. Since the incidence of revision is low (Vasarhelyi and Petis 2020), utilization of regional and national registry data is important when attempting to aggregate data on specific implant designs. This approach is not unique, as shown by Hoskins et al who utilized revision rates reported by the Australian Orthopaedic Association National Joint Replacement Registry (AOANJRR) database to determine “best-performing” prosthesis combinations in total joint arthroplasty (TJA) compared to prosthesis combinations with higher-than-anticipated revision rates (Hoskins et al. 2022). Best-performing TKA prostheses in their study were defined by prosthesis combinations with a 5-year cumulative percent revision rate of ≤ 2.3% and at least 500 procedures (Hoskins et al. 2022; Steiger et al. 2013). Recently, Kelly et al also conducted a registry review to compare revision risk between one manufacturer’s TKA implant system between generations (Kelly et al. 2024). Revision rates can provide a useful metric in identifying problematic implants and improving patient outcomes, as previously seen in older hip arthroplasty systems (De Steiger et al. 2011; Langton et al. 2010).
There remains a paucity of literature that uses aggregated implant design data across registries to compare their performance between generations. The purpose of this registry review is to evaluate whether there is a decrease in the revision rates for new TKA implant designs compared to older models from the same manufacturer. We hypothesize that improved implant designs would reduce all-cause TKA revision, which would justify the additional healthcare expense to develop these new implants.
Methods
A critical review of the literature was performed to find registries that report TJA implant data and revision rates. Institutional Review Board (IRB) exemption was granted. Registries that reported revision rates by year were reviewed and selected for study. Only registries with an annual report printed in the English language were reviewed. Data were recorded when available, including revision indication, total number of cases, revision percentage, and bearing type (fixed or mobile). Implant data collected included the femoral component model and the tibial component model. Registries that did not offer specific implant information were excluded from the study. Registries not providing at least five years of follow-up data were also excluded. We elected to include TKA implants only from the four largest orthopaedic implant manufacturers to have adequate power to make comparisons. These implants were divided into categories depending on their introduction into the marketplace. Based on their introduction to the market, systems from the same manufacturer are compared to their immediate successor and defined as a predecessor implant or a successor implant. Only contemporary implants were included, defined as having at least five years of approval and having been approved less than 30 years ago. Information regarding implant approval was gathered from the United States Food and Drug Administration (FDA) Premarket Approval (PMA) database and the United States FDA 510(K) Premarket Notification Database.
The study’s primary outcome was to compare revision rates between predecessor and successor generations of TKA implant designs. We secondarily sought to compare differences in revision rates and revision indications between registries.
Statistical Analysis
T-tests or ANOVA were used to compare continuous data. Chi-square testing was used for categorical data. Statistical significance was set at an alpha level of 0.05, and all statistics were performed using R Studio (Version 3.6.3, Vienna, Austria).
Results
There were four registries that reported specific implant design revision rates and met all the criteria that were identified. These included the AAOS American Joint Replacement Registry (AJRR), the Michigan Arthroplasty Registry Collaborative Quality Initiative (MARCQI), the National Joint Registry (NJR), and the AOANJRR. These registries provided published annual reports containing revision rates broken down by implant type. These registries combined to provide data on 3,168,279 primary and 1,554,953 revision TKAs. Follow-up data ranged from 5 to 20 years, depending upon the registry, and all four registries provided the indications for most revisions (Table 1 and Table 5).
There were nine different implants from four different manufacturers selected for review, all of which were FDA-approved between August 22nd, 1995, and November 14th, 2014. Of these implants, the NexGen and Persona systems (Zimmer Biomet), the Triathlon system (Stryker), the Sigma and Attune systems (Depuy Synthes), and the Genesis II and Journey II systems (Smith and Nephew) all utilize both femoral and tibial components from the same system. Implant combinations utilizing different femoral and tibial systems included Stryker’s Scorpio femoral component and ScorpioNRG tibial component, as well as Smith and Nephew’s Legion femoral component and Genesis II tibial component.
The flow chart in Figure 1 demonstrates the predecessor implant as it correlates to the successor implants from each manufacturer. It also displays each knee system’s 510(K) premarket submissions and FDA approval dates. The Journey II (Femur/Tibia) from Smith and Nephew was treated separately as the Smith and Nephew manufacturer already had entries for the predecessor and successor (Figure 1).
We collected 66 revision rate data points from each registry at follow-up intervals between 1 and 20 years. Data collected were similar for each registry as 19.7% of entries came from the MARCQI (n=13), 22.7% of entries from the AJRR (n=15), 25.8% from the NJR (n=17), and 31.8% from the AOANJRR (n=21). Data points represented the revision rate of a single implant from a single registry at a given time point. For instance, Zimmer/Biomet had 15 data points (22.7%) across the four registries on their NexGen and Persona TKA systems, as each registry had separate data points on the different generations and bearing surface types for each TKA system. Similarly, Stryker had 14 implant data points (21.2%) on their Triathlon and Scorpio/ScorprioNRG systems; Depuy Synthes had 18 implant data points (27.3%) on their Sigma and Attune systems; and Smith and Nephew had 19 implant data points (28.8%) on their Genesis II, Journey II, and Legion/Genesis II systems. Predecessor implants represented 43.9% (n=29) of the total data, while successors represented 50.0% (n=33). The Journey II made up 6.06% (n=4) of the entries.
A comparison of cumulative revisions of each implant shows similar yearly revision rates for each manufacturer and implant (Table 2). The Stryker Triathlon and Depuy Synthes Attune successor TKA systems showed higher revision rates than their respective predecessors at each recorded interval from the primary TKA. “Modern” Smith and Nephew TKA systems (i.e., Legion/Genesis II and Journey II) also reported higher revision rates at the specified yearly intervals compared to the predecessor Genesis II system. The revision rates for each implant trended upwards as the years of follow-up increased, regardless of implant generation.
When comparing the predecessor, successor, and Journey II implants, we found no significant differences between the groups at 1, 2, 3, 5, 7, 10, or 15 years (Table 3). Like implant-specific revision rates over time, there was an increase in revision rates with longer follow-up between generations. There was a significant difference at 4 years, with the predecessor implants averaging a revision rate of 2.6% and successor implants averaging a revision rate of 2.7%, while the Journey II averaged a revision rate of 4.6% (P=0.009) (Table 3).
Cumulative revision rates between each registry demonstrated significant differences at specific timeframes. Differences in revision rates at each year of follow-up were statistically significant at 1, 3, 5, and 10 years (Table 4).
Indications for revision were collected from each registry and categorized as shown in Table 5. Mechanical indications included metallosis, sizing issues, malalignment, and implant failure. Wear included all instances of tibial component, femoral component, patellar component, and polyethylene spacer wear. All categories comparing the four registries to each other reported significant differences.
Discussion
This registry review sought to evaluate differences in revision rates between TKA implant designs from newer and older generations. Contrary to our hypothesis, we found that modern implant designs failed to provide a significantly measurable difference in revision rate compared to predecessor implants individually and as a group.
Previous studies have questioned the effectiveness of newer-generation implant designs, but there has been variability in outcomes. Existing literature comparing the Attune and Sigma implants reported no differences in clinical outcomes at 2-year and 5-year follow-ups (Chua et al. 2019; Hauer et al. 2021; White et al. 2020), with one study by Behrend et al demonstrating similar joint awareness scores between the two generations (Behrend et al. 2019). One analysis of a United States-based TKA registry found higher risks for aseptic revision and instability with a modern implant design compared to its predecessor from the same manufacturer (Kelly et al. 2024). Although one study found statistically significant differences between newer and older TKA generations in PROMs at the 6-month postoperative period, they did not address whether the difference met minimally clinically important differences (Toossi et al. 2023). Conversely, a meta-analysis by Choudhury et al reported improved Knee Society scores and reduced patellofemoral complications in the modern Attune than the predecessor Sigma design (Choudhury et al. 2023).
Although newer generation TKA designs did not necessarily yield significantly lower revision rates, the expanded population that TKAs are performed on today may suggest improved outcomes in modern implant designs compared to their predecessor counterparts (S. M. Kurtz et al. 2009). The mean age of patients undergoing primary TKA is decreasing, with younger, more active patients requiring the procedure (S. M. Kurtz et al. 2009; S. Kurtz et al. 2007). The shift in demographic parallels the newer uncemented fixation techniques that have developed, including improved biologic ingrowth and osseointegration at the bone-implant interface (Helvie, Deckard, and Meneghini 2023). With the expansion of modern uncemented TKA designs that provide better fixation in younger patients – and thus better durability – having these implants perform similarly to predecessor counterparts with no change in revision risk may suggest inherent improvements. Controlling for patient selection through propensity score matching or regression analysis more accurately isolates implant design affecting outcomes. Although this is outside the scope of our registry review, stratification by specific patient demographic variables would allow for a more granular analysis of outcomes between successor and predecessor implant generations.
Implant prices constitute a substantial proportion of the cost associated with a TKA procedure. Nguyen et al conducted a systematic review and reported that approximately 22.3% of the total cost of a primary TKA procedure is attributed to the cost of the implant (Nguyen et al. 2024). A previous comparison of a “premium” TKA implant to a standard TKA implant found that the cost of the “premium” implant (i.e., components using mobile-bearing designs, high-flexion designs, oxidized-zirconium femoral components, newer moderately-crosslinked polyethylene inserts, or a combination thereof) averaged $1,000 higher with no discernable difference in revision rates (Gioe et al. 2011). With our findings being consistent with the existing literature comparing outcomes between generations, this can raise the question of whether a newer, more expensive implant design is justified if they do not demonstrate clear advantages (Chua et al. 2019; Gioe et al. 2011). However, design alone should not be weighted as the sole factor for a new implant’s success (Hauer et al. 2021). If implant design also coincides with improved perioperative management, such as by reducing surgical tray instrumentation and improving alignment with patient-specific instrumentation or computer-assisted navigation, these can contribute to justifying the higher upfront costs of an implant if operating efficiency and other episode-of-care costs are ultimately optimized (Law 2021; Sassoon et al. 2015).
The secondary aim of our registry review was to compare differences in TKA revision rates and revision indications by registry. The significant increase in revision rates with longer follow-up in all the registries certainly lends credence to the data in the present study. However, the statistical differences between the registries for overall revision rates and the reason for revision remain to be explained. The MARQI, limited to the single state of Michigan, covered the smallest population and was found to have the highest reported cumulative revision rates. This may be due to the completeness of the data, as 96% of all TJA procedures and revisions are captured by the registry (Hallstrom, Hughes, and Huddleston 2022). For comparison, the AJRR is reported to capture just 40% of TJA procedures in the United States (Hallstrom, Hughes, and Huddleston 2022). However, registries that prioritize completeness often need to sacrifice the level of detail, as seen with the AOANJRR (Porter, Rolfson, and de Steiger 2022). Regional differences in indications, training, and support systems may explain some differences between registries. One investigation of the AOANJRR data found that surgeons with low revision rates following TJA had different indication distributions compared with surgeons with a higher rate of revisions (Hoskins et al. 2022). They found that at a low revision rate, surgeons had fewer revisions for aseptic loosening, followed by instability (Hoskins et al. 2022). Therefore, it is possible that our meta-analysis comparing revision indications between registries (Table 5) is potentially skewed by data collection techniques and surgeon differences.
Revision rate is a parameter previously used in literature to guide surgeon decision-making when selecting implant designs that will optimize patient outcomes. In the context of total shoulder arthroplasty (TSA), Page et al found there were higher revision rates in anatomic TSAs with non-crosslinked, all-polyethylene glenoid components when compared to crosslinked, all-polyethylene glenoid components (Page et al. 2022). Similarly, the lack of reduction in cumulative revision rates between premium and standard TKA and THA implants reported by Gioe et al would suggest that implants with newer technologies do not necessarily translate to lower revision rates (Gioe et al. 2011). The natural extension of a registry analysis assessing potential differences in revision rates of implants would be to collect PROMs that could provide insight into patient satisfaction. It is a rightful assumption that higher revision rates of an implant design would inversely correlate with PROM metrics. However, a recent study by Hoskins et al found there was practically no correlation between revision rate and PROMs in THA and TKA procedures (Hoskins et al. 2024). This would suggest that either revision rate or PROMs alone would be incomplete, imperfect indicators of an arthroplasty procedure and may well, in fact, track different aspects of an implant’s performance (Hoskins et al. 2024). Furthermore, previous prospective trials comparing newer generation TKA implant designs to their predecessors using PROM scores and patient satisfaction assessments found no significant differences between generations (Chua et al. 2019; Hauer et al. 2021). Besides the studies calling into question the cost-effectiveness of newer implant designs if they yield comparable clinical outcomes to older generations, the lack of sensitivity in the measured functional outcome scores is also cited as a potential limitation (Hoskins et al. 2022; Hauer et al. 2021).
Despite the non-superior outcomes in revision rate seen in our study between newer and older generation TKA systems, this should not be taken as a suggestion to pause innovation. Highly cross-linked polyethylene remains a noteworthy advancement in TKA design that significantly reduced the number of revision TKAs being performed for polyethylene wear (Lombardi, Berend, and Adams 2014). Advancements in uncemented fixation have led to an increase in the number of younger patients with higher activity demands undergoing TKA (Mont et al. 2017; Harwin et al. 2015). Expanded implant sizing and alternative pivoting designs also help to further improve knee kinematics and accommodate morphological variations in bone (Harris et al. 2019; Batra et al. 2021; Dai et al. 2014). These innovations provide demonstrable benefits that span more than just the revision rate, as these modern techniques can optimize surgical efficiency or improve implant compatibility with the patient. Furthermore, if a newer-generation implant design is optimized to facilitate faster recovery times, earlier mobilization, and a less invasive surgical approach, this translates into tangible value for the healthcare system and patient. The study’s findings do not intend to downplay these advancements but rather highlight the need for evidence outside of revision rates or PROMs demonstrating that changes in implant design lead to their improved performance.
Our findings should be interpreted within the context of the study design. As with all extensive database studies, there are differences in data collection methodologies and follow-up intervals that introduce bias. The large number of patients evaluated may have mitigated some of these discrepancies. The release dates of each implant did not necessarily overlap those from other manufacturers. By comparing each predecessor to their successor from the same manufacturer, we attempted to negate this effect. The scope of the registry analysis also could not allow for a more granular analysis of patient selection by matching certain demographics of interest, which could better isolate how differences in implant generations can affect outcomes. Another limitation is that registry data restricted our analysis to revision rates and could not comment on the comparisons in PROMs, which are not always reported in registry databases (Vasarhelyi and Petis 2020). Recommendations for further investigation include developing prospective, randomized trials that assessed PROMs between modern TKA systems and their predecessors, as well as evaluating surgical efficiency. This would be interpreted not independently of revision rate but rather as an adjunct to it, allowing orthopaedic surgeons to obtain a better understanding of an implant design’s performance in relation to their older counterparts.
Conclusion
In the present era of cost consciousness, it is essential to encourage the development of new implants that are more durable and are associated with improved patient results. In this review of registry data, we found that modern TKA implants do not appear to significantly decrease revision rates compared to predecessor implants. Future studies that include PROMs and surgical efficiency are the logical next step to assess the contribution of new technology to TKA outcomes.