The SMART-OR Framework for Implementing Artificial Intelligence in the Operating Room

Hillary Lia

PhD

1,2 Emailhillary.lia@mail.utoronto.ca

Assistant Professor of Research

Divya Kewalramani

3,8✉

Muhammad Uzair Khalid

MD, MSc

1,4 Phone312-478-9817 Phone@divvya20k Emaildk1362@rwjms.rutgers.edu

Justin Benton

BSN

3 Phone0009-0004-9125-0909 Emailuzarkh@student.ubc.ca

Caterina Masino

1 Phone0009-0006-3006-0532 Emailjb2066@rwjms.rutgers.edu Emailcaterina.masino@uhn.ca

Rachel L. Choron

Tyler J. Loftus

MD, PhD

5 Phone0009-0001-7256-1750 Emailrc1147@rwjms.rutgers.edu Emailtyler.loftus@surgery.ufl.edu

Mayur Narayan

MD, MPH, MBA, MHPE

3 Emailmayur.narayan@rutgers.edu

Wagner H. Souza

PT, PhD

1,6 Emailwagner.souza@uhn.ca

Amin Madani

MD, PhD

1,7 Emailamin.madani@uhn.ca

1 Surgical Artificial Intelligence Research Academy University Health Network Toronto ON Canada

2 Temerty Faculty of Medicine University of Toronto Toronto ON Canada

3 Division of Acute Care Surgery, Department of Surgery Rutgers Robert Wood Johnson Medical School New Brunswick NJ United States

4 Department of Surgery University of British Columbia Vancouver BC Canada

5 Department of Surgery University of Florida Health Gainesville FL United States

6 Division of General Surgery University Health Network Toronto ON Canada

7 Department of Surgery University of Toronto Toronto ON Canada

Division of Acute Care Surgery Rutgers Robert Wood Johnson Medical School 125 Paterson Street, Suite 6300 08901 New Brunswick NJ

Hillary Lia, PhD^1,2; Divya Kewalramani, MD³; Muhammad Uzair Khalid, MD, MSc^1,4; Justin Benton, BSN³; Caterina Masino, MA¹; Rachel L. Choron, MD³; Tyler J. Loftus, MD, PhD⁵; Mayur Narayan, MD, MPH, MBA, MHPE³; Wagner H. Souza, PT, PhD^1,6; Amin Madani, MD, PhD^1,7

Author Affiliations:

¹ Surgical Artificial Intelligence Research Academy, University Health Network, Toronto, ON, Canada

² Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada

³ Division of Acute Care Surgery, Department of Surgery, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, United States

⁴ Department of Surgery, University of British Columbia, Vancouver, BC, Canada

⁵ Department of Surgery, University of Florida Health, Gainesville, FL, United States

⁶ Division of General Surgery, University Health Network, Toronto, ON, Canada

⁷ Department of Surgery, University of Toronto, Toronto, ON, Canada

Corresponding Author: Divya Kewalramani, MD

Assistant Professor of Research, Division of Acute Care Surgery

Rutgers Robert Wood Johnson Medical School

125 Paterson Street, Suite 6300

New Brunswick, NJ 08901

Phone: 312-478-9817

Email: dk1362@rwjms.rutgers.edu

X: @divvya20k

ORCID: 0009-0004-9125-0909

Contacts:

HL: hillary.lia@mail.utoronto.ca 0000-0003-4775-0987

DK: dk1362@rwjms.rutgers.edu 0009-0004-9125-0909

MK: uzarkh@student.ubc.ca 0000-0001-6269-9839

JB: jb2066@rwjms.rutgers.edu 0009-0006-3006-0532

CM: caterina.masino@uhn.ca 0009-0001-7256-1750

RC: rc1147@rwjms.rutgers.edu 0000-0002-2297-9956

WS: wagner.souza@uhn.ca 0000-0003-3982-5475

TL: tyler.loftus@surgery.ufl.edu 0000-0001-5354-443X

MN: mayur.narayan@rutgers.edu 0000-0002-7232-7446

AM: amin.madani@uhn.ca 0000-0003-0901-9851

Words

2989 words

References

33 references

Abstract

Background

Intraoperative artificial intelligence (AI) decision support systems hold promise for improving surgical outcomes, yet significant barriers impede their translation from development to clinical deployment. A systematic implementation framework informed by stakeholder perspectives is needed to guide responsible adoption in operating room (OR) settings.

Methods

We conducted a qualitative study using the Consolidated Framework for Implementation Research at a single North American academic institution. Phase I involved semi-structured interviews with OR personnel (surgeons, trainees, nurses, biomedical engineers) recruited through purposive maximum variation and snowball sampling until thematic saturation. Phase II comprised focus groups with patients recruited via convenience sampling. Interview and focus group transcripts underwent iterative thematic analysis using both deductive and inductive coding approaches.

Results

Twenty-two stakeholder interviews and two patient focus groups (n = 8) identified unique barriers and facilitators that coalesced into five major themes defining implementation requirements for intraoperative AI decision support: intuitive design, adequate training, maximizing adaptability, ongoing support, and fostering buy-in. These themes were contextualized across the surgical timeline: pre-implementation, implementation, and post-implementation phases, to create a comprehensive SMART-OR framework. Key barriers included overreliance concerns, automation bias risks, workflow disruption, team coordination challenges, and medico-legal ambiguity. Facilitators included perceived accuracy improvements, real-time guidance utility, and enhanced educational opportunities.

Conclusions

This study provides the first comprehensive implementation framework for intraoperative AI decision support, offering practical guidance across the technology lifecycle. The framework addresses critical gaps between AI development and clinical deployment by integrating diverse stakeholder perspectives into actionable recommendations. Future implementation efforts should prioritize transparent validation, coordinated training, and clear governance structures to ensure responsible adoption.

Keywords

Artificial intelligence

intraoperative decision support

operating room

implementation science

human factors

adoption

SMART-OR Implementation framework

Introduction

Intraoperative adverse events cause preventable harm to surgical patients worldwide [1]. Recent advances in artificial intelligence (AI) have enabled the development of real-time decision support systems that recognize instruments, delineate anatomy, classify surgical phases, and identify unsafe zones of dissection during ongoing procedures [2–4]. These technologies aim to augment surgeons' perception at critical moments, with particular promise for resource-limited settings where specialist support is unavailable [1, 5]. Despite impressive technical performance, translation into routine practice has lagged substantially. For most intraoperative AI tools, rigorous implementation studies remain sparse, and few have progressed beyond pilot testing to become evidence-based standards of care [6]. This disconnect between algorithm development and clinical deployment represents a critical barrier to realizing the patient safety benefits these technologies promise.

Successful deployment requires alignment with operating room (OR) workflows, human-factors-informed interface design, role-specific training, mechanisms for performance monitoring, and generalizability across sites with varying equipment and staffing [7–10]. While established implementation science frameworks provide valuable theoretical guidance [11–13], they operate at a level of abstraction requiring substantial interpretation for the unique demands of the OR: real-time decision-making under pressure, strict sterility requirements, complex multi-professional teams, and zero tolerance for failures that could compromise patient safety. Context-specific translation of these frameworks into actionable guidance for intraoperative AI remains an unmet need. The GoNoGoNet algorithm, a validated computer vision system for identifying safe versus unsafe zones of dissection during laparoscopic cholecystectomy, exemplifies this challenge [14]. Despite demonstrated potential to reduce bile duct injuries, GoNoGoNet’s clinical impact remains unrealized without effective implementation strategies [15].

To address the critical gap between intraoperative AI development and clinical deployment, this study aims to qualitatively identify barriers, facilitators, and implementation requirements for intraoperative AI decision support. By engaging diverse stakeholders, we developed the first implementation framework and actionable checklist for deploying AI in the OR, offering concrete guidance for future implementation efforts.

Methods

Study Design and Theoretical Framework

A two-phased qualitative study using thematic analysis methodology was conducted. Thematic analysis was adopted to allow for flexible data analysis, combining inductive and deductive approaches for integrating existing literature with results emerging from the study [16]. The study design was informed by Consolidated Framework for Implementation Research (CFIR), which provided a structured lens for exploring implementation determinants across five domains: intervention characteristics, outer setting, inner setting, characteristics of individuals, and implementation process.¹¹ The Replicating Effective Programs (REP) framework additionally guided our focus on practical, phase-specific implementation steps [12].

The University of Health Network Research Ethics Board approval was obtained for this study and all participants provided written informed consent (#23–517). Data collection occurred between 11/29/2023 and 08/28/2024. To ground discussions in clinical reality, we used GoNoGoNet, a validated deep learning algorithm [17, 18], as a specific use case throughout all interviews and focus groups. Participants viewed video demonstrations and received lay-language explanations of the algorithm's functionality, enabling specific feedback rather than abstract speculation.

Participant Recruitment and Data Collection

Phase I: We employed purposive maximum variation sampling to recruit OR personnel (surgeons, trainees, nurses, biomedical engineers) at one academic institution, with diverse experience levels, and attitudes toward AI adoption, supplemented by snowball sampling. Thematic saturation was operationalized as no new themes across three consecutive interviews [19]. Semi-structured interviews were conducted via Microsoft Teams (Microsoft Corporation, Redmond, WA) using an interview guide (Supplementary Material, Section 1), structured around CFIR domains.

The interviewer followed the participant lead to explore emerging topics and perspectives. Interviews were audio-recorded, de-identified, transcribed verbatim, verified for accuracy.

Phase II: Patients were recruited through convenience sampling from a research volunteer database, regardless of prior surgical experience. Two focus groups with four participants each were conducted using semi-structured discussion guides (Supplementary Material, Section 2).

Each 60-minute session began with standardized education about intraoperative AI, followed by moderated discussion of patient perspectives on benefits, risks, disclosure preferences, and trust considerations. Focus groups were audio recorded, de-identified, transcribed, and verified.

Data Analysis

Two researchers independently coded all transcripts using NVivo software (Lumivero, Denver, CO). An initial deductive coding framework based on REP implementation phases was applied to all transcripts, then expanded inductively as new concepts emerged. Discrepancies between coders were resolved through consensus between the full research team. Rigor was maintained through negative case analysis and audit trail maintenance [20].

Framework Development

Identified themes were synthesized into a three-phase Systematic Methodology for AI Readiness and Translation in the Operating Room (SMART-OR) implementation framework with actionable checklists. The preliminary framework underwent member-checking, who confirmed relevance and suggested refinements that were incorporated to produce the final stakeholder-informed framework.

Results

Phase I enrolled 20 OR personnel (91% recruitment rate) including 10 surgeons, 6 nurses, 2 surgical trainees, and 2 biomedical engineers to the study. Participants represented practice settings across Canada and the United States (Table 1). Phase II enrolled six patient participants (recruitment rate: 6%), comprising three males and three females who participated in two focus groups. In Phase I, each interview lasted a mean of 29 minutes (range, 18–45 minutes). Five major themes emerged from the analysis that defined requirements for implementing intraoperative AI decision support: intuitive design, adequate training, maximizing adaptability, ongoing support, and fostering buy-in. These themes span pre-implementation, implementation, and post-implementation phases and address technical, educational, organizational, and cultural dimensions of technology adoption (Fig. 1).

Table 1

Demographic and Practice Characteristics of Operating Room Personnel (Phase 1)
Characteristics	Value
Attending Surgeons, n
Female, n (%)	5 (50)
Male, n (%)	5 (50)
Surgical Trainees, n
Female, n (%)	1 (50)
Male, n (%)	1 (50)
Postgraduate level, range	PGY3-PGY4
Operating Room Nurses, n
Female, n (%)	4 (66)
Male, n (%)	2 (33)
Biomedical Engineers, n
Female, n (%)	2 (100)
Male, n (%)	0
Practice Setting
Academic medical center, n (%)	14 (70)
Community hospital, n (%)	6 (30)

Fig. 1

Thematic Framework for Implementing Intraoperative Artificial Intelligence Decision Support Systems

Perceived Benefits of Intraoperative AI Decision Support

Participants across all stakeholder groups identified three anticipated benefits. First, surgeons described the technology as enhancing patient safety by serving as a cognitive checkpoint during routine cases: “the surgeon could be pretty much on autopilot for a straight-forward case and then something like this causes a warning to pop up and grab your attention somehow. It’s like a slow-down moment.” This function was viewed as particularly valuable for surgeons practicing in resource-limited settings without immediate access to expert consultation.

Second, surgeons and trainees identified educational applications. The visual overlay was described as facilitating intraoperative teaching discussions: “if the red zone (no-go) is marked out and the surgeon asks the trainee to go in the red zone or themselves approach the red zone, that becomes an automatic discussion point.” Another trainee noted “the capability to turn the AI model off so the learners can see what safe and unsafe zones actually looks like on normal anatomy is really helpful.” Third, participants also noted value in post-operative video review for feedback and coaching, where the annotated anatomy could clarify dissection decisions for trainees.

Perceived Risks of Intraoperative AI Decision Support

Surgeons, non-medical staff, and patients described a wariness toward the technology due to the potential for overreliance, leading to poor decision making. One surgeon questioned, “would you be falsely reassured by this technology that you're in the Go Zone despite not actually being there and then causing a bile duct injury?” Surgeons emphasized the importance of positioning AI as an adjunct to surgical judgment rather than a replacement. One surgeon described the importance of appropriate framing: "it's not telling you how to do an operation, and it's not making a judgment for you. It's an added layer of safety." Participants highlighted the importance of surgeons learning the algorithm's limitations and recognizing situations when clinical judgment should override AI recommendations.

Participants also expressed concern about cognitive overload from introducing additional technology into the OR. One surgeon stated, "the goal is not to distract the surgeon, because they still bear the primary responsibility for the patient’s outcome." Another surgeon noted that if the technology presents excessive clinically irrelevant information, "it's actually not helping and could be perceived as a distraction more often than not." Additionally, concerns about medicolegal implications of AI use were also raised. Views on liability varied, with some believing AI use would provide protection in litigation and others fearing it could increase liability in the event of adverse outcomes.

Theme 1: Intuitive Design

Participants emphasized that interface design and ease of setup directly influence technology adoption, with well-designed systems reducing training burden and facilitating integration into existing workflows. One surgeon stated, “if the integration is not smooth, barriers to adoption increase significantly.” A biomedical engineer explained: “if you can make it as easy as plug ‘A’ into ‘A’ and ‘B’ into ‘B’ and everything works, then everything will go smoothly.” Participants suggested that interface elements such as labeling, color-coding, and information density should reinforce appropriate use of AI as a decision-making adjunct rather than a directive authority. Participants also emphasized that achieving intuitive design requires iterative usability testing in both simulation and clinical environments. One surgeon stated, “usability studies analyzing the workflow, studying the environment and how these cases progress are essential to optimize product design and increase adoption.”

Theme 2: Adequate Training

Participants emphasized that effective training must be tailored to professional roles and should communicate the purpose and value of the technology. One participant stated, "When new technologies are introduced without sufficient context, learners understandably question their relevance. It is therefore critical to demonstrate how these tools concretely advance patient care."

Surgeons described requiring conceptual understanding of algorithm function and limitations to appropriately integrate AI into clinical decision-making. One surgeon stated, “I need to understand how the algorithm was trained, which datasets it was trained on, and how it's been validated.” This algorithmic literacy enables surgeons to interpret predictions accurately, recognize scenarios where AI may be unreliable, and make informed decisions when AI recommendations conflict with clinical judgment.

Nurses anticipated primary responsibility for equipment setup and frontline troubleshooting, necessitating practical, hands-on training. They described the need to practice setting up and troubleshooting the equipment outside of the OR to gain confidence. They also emphasized the value of quick-reference materials: "Physical tipsheets to refer back to the troubleshooting steps would be beneficial." Similarly, biomedical engineers described needing advanced technical knowledge for system maintenance and complex problem-solving. One engineer stated, “Troubleshooting tips for common technical failure reasons including cables or hardware components are essential.” Inefficiencies related to inadequate training and resources may result in underutilization of otherwise useful technology.

Theme 3: Maximizing Adaptability

Participants emphasized that successful implementation requires adaptable technology capable of accommodating diverse user preferences and practice contexts rather than a standardized, one-size-fits-all approach. This need for adaptability manifested across three domains: use cases, display design, and physical infrastructure.

Surgeons described fundamentally different anticipated workflows. Some envisioned intermittent use: "toggle on and off to ensure that you're still in the ‘Go” zone." Others anticipated continuous monitoring with the overlay persistently displayed. Additional use cases included automatic alert-triggered displays that appear only when approaching high-risk zones, used exclusively as an educational tool for trainees rather than for clinical decision support, and split-display configurations where trainees view the overlay while supervising surgeons see the unaugmented field. One surgeon described, "the trainee doing the surgery can look at the screen with the overlay but someone teaching it could see it on the regular display and see how they are doing." Display preferences also varied substantially. Some surgeons preferred high-confidence-only overlays: "I'd prefer to see the overlay only at 90% or higher confidence.” Others wanted graduated confidence displays with intermediate warning zones labelled as a “yellow” zone, prompting them to slow down during the dissection. Ideally, the software would be designed to accommodate multiple preferences.

Nurses and biomedical engineers identified variability in OR physical space and availability of technology as a barrier. Providing a second display was an attractive option when compatibility across vendors could be an issue. However, “adding another display introduces logistical challenges related to transport, setup, and physical space constraints.” Ideally, the technology could be integrated with existing equipment, but it was acknowledged that this was not always possible.

Theme 4: Ongoing Support

Participants described the need for phased implementation support that is intensive during initial deployment. One nurse stated, "the vendor/ researcher should be present in the OR to help support arising issues as the team uses the technology for the first time." This on-site presence allows for real-time troubleshooting of technical issues and observation of workflow challenges not anticipated during training. As implementation progresses, support can transition from continuous on-site presence to on-call availability and eventually to standard institutional IT support channels with clear escalation pathways for complex issues. Beyond initial deployment support, participants emphasized the importance of sustained responsiveness to user feedback. One surgeon stated, "be open-minded to feedback as the technology matures. Soliciting feedback and then being responsive to it helps adoption." This ongoing engagement enables iterative refinement of both the technology and training materials based on real-world implementation challenges and evolving user needs.

Theme 5: Fostering Buy-In

Participants emphasized that successful implementation depends on cultivating an organizational culture that supports technology adoption and engaging key stakeholders who can champion the change. Surgeons described the need to establish a professional environment where using AI decision support is viewed as aligned with surgical excellence rather than a sign of inadequacy. One surgeon stated, “sometimes we hesitate to ask for help because it risks appearing inadequate; a supportive culture is instead open to ideas, inquisitive, and committed to improvement.” Participants identified trust as foundational to adoption, with surgeons requiring transparent evidence about the algorithm before integrating recommendations into clinical practice.

Finally, participants emphasized selecting early adopters as champions. A surgeon warned, “failing to choose the right first adopters (surgeons) is a major barrier to sustainable adoption.” Champions who are involved early develop a sense of ownership and can advocate for the technology among colleagues, provide peer-to-peer mentorship, and serve as conduits for feedback between end-users and technology developers.

Implementation Framework and Checklist

This study integrates expert opinion from various stakeholders to explore the perceived risks and benefits of intraoperative AI decision support and to elicit the aforementioned key themes that were coalesced into the SMART-OR implementation framework with actionable checklists (Fig. 2).

Fig. 2

Three-Phase Implementation Framework and Actionable Checklist for Intraoperative Artificial Intelligence

Discussion

While AI for intraoperative decision support has advanced rapidly, translation into routine clinical practice remains limited. Existing reporting and evaluation frameworks include the IDEAL-Robotics framework (Idea, Development, Exploration, Assessment, Long-term study for surgical robotics) which provides stage-gated evaluation of innovation [21]; DECIDE-AI (Developmental and Exploratory Clinical Investigations of Decision-support systems driven by AI) which standardizes early-phase evaluation [22]; CPI-AI (Clinical Practice Integration for Artificial Intelligence) which emphasizes safe clinical integration [23]; and Consolidated Standards of Reporting Trials – Artificial Intelligence extension (CONSORT-AI) which extend reporting standards for AI-driven trials [24, 25]. Collectively, these frameworks clarify whether a surgical AI system works safely and effectively. However, they provide limited guidance on how to implement AI systems sustainably in complex, high-stakes surgical environments.

In contrast, the broader field of digital health implementation has matured over the past two decades, producing robust, theory-driven frameworks such as the CFIR, the Non-Adoption, Abandonment, Scale-up, Spread, and Sustainability (NASSS) framework [26], and the Expert Recommendations for Implementing Change (ERIC) taxonomy [13]. However, they require extensive interpretation for intraoperative use, where decision-making occurs in seconds, sterility constraints limit physical interaction with devices, and coordination among multidisciplinary teams is both dynamic and hierarchical. The SMART-OR framework directly closes the translational gap between high-level implementation science and the realities of intraoperative AI adoption. Its novelty lies in merging AI-specific safeguards (e.g., explainability, automation-bias mitigation, post-deployment drift monitoring) with OR constraints such as limited physical space, latency, and team coordination. A comparison with existing frameworks and their limitations is presented in Table 2. Additionally, the checklist is intended to be used as a practical guide when transitioning from research to intraoperative implementation.

Table 2

Translating General Implementation Science Principles to Surgical AI-Specific Guidance Comparison of implementation guidance provided by established frameworks (Consolidated Framework for Implementation Research [CFIR], Expert Recommendations for Implementing Change [ERIC], Replicating Effective Programs [REP]) versus context-specific guidance developed in this study for intraoperative artificial intelligence.
Implementation Domain	Generic Framework Guidance	SMART-OR Framework
Training Staff	Conduct educational meetings (ERIC) Provide training (CFIR)	● Surgeons: Algorithm architecture, training data, validation results, limitations, decision frameworks when AI conflicts with judgment ● Nurses: Hands-on setup practice outside OR, troubleshooting decision trees, quick-reference cards for intraoperative use ● Engineers: System architecture, network requirements, hardware maintenance, advanced troubleshooting (XX hours)
Local Adaptation	Assess for readiness and identify barriers (CFIR) Allow for local adaptation (REP)	Specific adaptations identified: ● Display modes: manual toggle, automatic alert-triggered, continuous, teaching-only, post-op review ● Confidence thresholds: user-customizable (90%) ● Physical setup: integrated vs. separate display options ● Vendor compatibility assessment and workarounds
Ongoing Support	Provide ongoing consultation (ERIC) Develop a formal implementation blueprint (REP)	Phased support protocol: ● Weeks 1–2: On-site presence for all cases ● Weeks 3–4: Rapid response available within minutes ● Weeks 5–8: Scheduled check-ins plus as needed ● Months 3+: Standard IT channels with clear escalation pathway
Technology Risks	Address challenges to using the intervention (CFIR)	AI-specific risk mitigation: ● Training explicitly addresses overreliance and deskilling ● Interface labels prompt critical thinking vs. commands ● Cognitive load reduction through usability testing ● Clarity on when to override AI recommendations ● Strategies for when AI and clinical judgment diverge
Fostering Adoption	Identify and prepare champions (ERIC) Assess for organizational culture (CFIR)	OR culture-specific strategies: ● Frame as professional standard, not remediation ● Leadership visible use and endorsement ● Champion characteristics: clinically respected, patient with iteration ● Evidence package: broad training data, expert labeling, performance metrics, failure modes transparently reported
Physical Integration	Not addressed in general frameworks	OR-specific requirements: ● Sterile field compatibility ● Physical space constraints varying by OR ● Cable/connection specifications ● Latency requirements for real-time use ● Point-of-use troubleshooting materials
Patient Perspectives	Engage patients (CFIR)	Nuanced patient considerations: ● Develop disclosure language surgeons can customize

The patterns we identified build on three decades of electronic medical record (EMR) implementation. Kaiser Permanente's successful $4 billion HealthConnect deployment demonstrated that phased rollout, physician leadership, extensive training, and workflow integration enabled large-scale health IT adoption [27]. Conversely, the United Kingdom's £10 billion National Programme for Information Technology (IT) failed due to top-down mandates, insufficient end-user engagement, one-size-fits-all design, and underestimated complexity, precisely the barriers our stakeholders identified as critical to avoid [28]. Three lessons apply directly. First, complexity across multiple domains predicts failure. Our identification of diverse user preferences signals inherent complexity requiring adaptable systems rather than standardized solutions. Second, physician engagement from inception is non-negotiable. Our emphasis on identifying surgeon champions early and framing AI as professional enhancement rather than remediation aligns with this principle [13, 29–31]. A consistent facilitator was the explicit positioning of intraoperative AI as an adjunct for surgical decision-making, reinforcing team trust. Third, training investment determines adoption. The consistent theme across stakeholder groups highlighting the importance of comprehensive training in technology uptake, mirrors findings from successful EMR implementations. These parallels suggest that surgical AI faces fundamentally similar organizational challenges to other health IT tools, with the additional complexity of real-time intraoperative use.

Our findings also highlight barriers to intraoperative AI adoption. Overreliance on AI emerged as a central concern. Surgeons identified increased risk through automation bias and deskilling unless design and training explicitly teach critical appraisal of AI outputs [32]. Unlike AI in radiology, intraoperative tools must deliver actionable, non-disruptive guidance in real time, imposing stringent requirements for latency, accuracy, and interface ergonomics [33]. Implementation must also account for tightly interdependent OR teams (surgeons, anesthesiologists, nurses, technicians) necessitating coordinated, role-specific training and clear accountability for AI-related issues. Finally, medico-legal ambiguity remains a barrier; aligning policy and guidance to recognize AI as a decision-support adjunct will be crucial to responsible adoption.

There are several limitations to this study. First, all participants were drawn from a single North American academic institution, which may limit the transferability of our findings to other practice settings. Future studies should include global stakeholders. Second, several sub-themes from this study require further exploration, including legislation governing intraoperative AI use, impacts on medical litigation, AI-introduced distractions, and optimal methods for usability testing. Lastly, purposive sampling may have introduced selection bias toward individuals with greater AI exposure, despite efforts to include both AI enthusiasts and skeptics.

Conclusion

This study describes the first stakeholder-informed SMART-OR implementation framework specifically designed to bridge the gap between development and implementation of intraoperative AI decision support. By systematically eliciting perspectives from surgeons, surgical trainees, nurses, biomedical engineers, and patients we identified five core implementation requirements and synthesized these into an actionable checklist. While validation through implementation is essential and ongoing, this formative work establishes a roadmap for researchers, clinicians, and healthcare organizations seeking to responsibly translate intraoperative AI from development to clinical practice.

Declarations

Ethics approval and consent to participate:

• This study was approved by the University of Health Network Research Ethics Board (Protocol #23-5170).

All participants provided written informed consent prior to participation.

The study was conducted in accordance with the Declaration of Helsinki and adhered to Good Clinical Practice guidelines.

Consent for publication:

• Not applicable. This manuscript does not contain any individual person's data in any form (including individual details, images, or videos). All data are presented in aggregate form only.

Availability of data and material:

The qualitative datasets generated and analyzed during this study are not publicly available due to participant privacy and confidentiality requirements outlined in the ethics approval and informed consent documents. Aggregate data supporting the findings are available from the corresponding author upon reasonable request and subject to appropriate data sharing agreements.

Competing interest:

• All authors declare no financial or non-financial competing interests.

Funding:

This work received no funding

Author Contribution

HL: Conceptualization, study design, recruitment, data collection, data analysis, framework generation, preparation of tables and figures, manuscript preparationDK: preparation of tables and figures, manuscript writing, manuscript editingMUK: Data collection, data analysis, thematic segmentation, manuscript writing, manuscript editingJB: manuscript editingCM: Conceptualization, methodology, data verification, manuscript editingRLC: manuscript editingTJL: manuscript editingMN: manuscript editingWS: manuscript editingAM: Conceptualization, supervision, framework generation, preparation of tables and figures, manuscript preparation

DK: preparation of tables and figures, manuscript writing, manuscript editing

MUK: Data collection, data analysis, thematic segmentation, manuscript writing, manuscript editing

JB: manuscript editing

CM: Conceptualization, methodology, data verification, manuscript editing

RLC: manuscript editing

TJL: manuscript editing

MN: manuscript editing

WS: manuscript editing

AM: Conceptualization, supervision, framework generation, preparation of tables and figures, manuscript preparation

Acknowledgement

None

Electronic Supplementary Material

Below is the link to the electronic supplementary material

Supplementary Material 1

References

Kewalramani D, Loftus TJ, Coleman JR, Kaafarani H, Narayan M. Using AI to bridge global surgical gaps: high tech, high impact. The Lancet [Internet]. 2024 May [cited 2025 Dec 11];403(10438):1746–7. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0140673623027538

Laplante S, Madani A. Artificial intelligence in surgery. In: Artificial Intelligence in Clinical Practice [Internet]. Elsevier; 2024 [cited 2025 Dec 11]. p. 211–6. Available from: https://linkinghub.elsevier.com/retrieve/pii/B978044315688500019X

Khalid MU, Laplante S, Madani A. Machines with vision for intraoperative guidance during gastrointestinal cancer surgery. Front Med (Lausanne). 2022;9:1025382.

Hashimoto DA, Rosman G, Rus D, Meireles OR. Artificial Intelligence in Surgery: Promises and Perils. Annals of Surgery [Internet]. 2018 July [cited 2025 Dec 11];268(1):70–6. Available from: https://journals.lww.com/00000658-201807000-00013

Kewalramani D, Loftus TJ, Mayol J, Narayan M. Artificial intelligence in surgery: a global balancing act. British Journal of Surgery [Internet]. 2024 Mar 2 [cited 2025 Dec 11];111(3):znae062. Available from: https://academic.oup.com/bjs/article/doi/10.1093/bjs/znae062/7634459

Morris ZS, Wooding S, Grant J. The answer is 17 years, what is the question: understanding time lags in translational research. J R Soc Med [Internet]. 2011 Dec [cited 2025 Dec 11];104(12):510–20. Available from: https://journals.sagepub.com/doi/10.1258/jrsm.2011.110180

Finkelstein J, Gabriel A, Schmer S, Truong TT, Dunn A. Identifying Facilitators and Barriers to Implementation of AI-Assisted Clinical Decision Support in an Electronic Health Record System. J Med Syst [Internet]. 2024 Sept 18 [cited 2025 Dec 11];48(1):89. Available from: https://link.springer.com/10.1007/s10916-024-02104-9

Hassan M, Kushniruk A, Borycki E. Barriers to and Facilitators of Artificial Intelligence Adoption in Health Care: Scoping Review. JMIR Hum Factors [Internet]. 2024 Aug 29 [cited 2025 Dec 11];11:e48633. Available from: https://humanfactors.jmir.org/2024/1/e486339.

Petersson L, Larsson I, Nygren JM, Nilsen P, Neher M, Reed JE, et al. Challenges to implementing artificial intelligence in healthcare: a qualitative interview study with healthcare leaders in Sweden. BMC Health Serv Res [Internet]. 2022 Dec [cited 2025 Dec 11];22(1):850. Available from: https://bmchealthservres.biomedcentral.com/articles/10.1186/s12913-022-08215-8

10.

Nair M, Svedberg P, Larsson I, Nygren JM. A comprehensive overview of barriers and strategies for AI implementation in healthcare: Mixed-method design. Six S, editor. PLoS ONE [Internet]. 2024 Aug 9 [cited 2025 Dec 11];19(8):e0305949. Available from: https://dx.plos.org/10.1371/journal.pone.0305949

11.

Damschroder LJ, Aron DC, Keith RE, Kirsh SR, Alexander JA, Lowery JC. Fostering implementation of health services research findings into practice: a consolidated framework for advancing implementation science. Implementation Sci [Internet]. 2009 Dec [cited 2025 Dec 11];4(1):50. Available from: http://implementationscience.biomedcentral.com/articles/10.1186/1748-5908-4-50

12.

Kilbourne AM, Neumann MS, Pincus HA, Bauer MS, Stall R. Implementing evidence-based interventions in health care: application of the replicating effective programs framework. Implementation Sci [Internet]. 2007 Dec [cited 2025 Dec 11];2(1):42. Available from: http://implementationscience.biomedcentral.com/articles/10.1186/1748-5908-2-42

13.

Powell BJ, Waltz TJ, Chinman MJ, Damschroder LJ, Smith JL, Matthieu MM, et al. A refined compilation of implementation strategies: results from the Expert Recommendations for Implementing Change (ERIC) project. Implementation Sci [Internet]. 2015 Dec [cited 2025 Dec 11];10(1):21. Available from: http://implementationscience.biomedcentral.com/articles/10.1186/s13012-015-0209-1

14.

Khalid MU, Laplante S, Masino C, Alseidi A, Jayaraman S, Zhang H, et al. Use of artificial intelligence for decision-support to avoid high-risk behaviors during laparoscopic cholecystectomy. Surg Endosc [Internet]. 2023 Dec [cited 2025 Dec 11];37(12):9467–75. Available from: https://link.springer.com/10.1007/s00464-023-10403-4

15.

Peters DH, Tran NT, Adam T. Implementation research in health: a practical guide. Geneva: Alliance for Health Policy and Systems Research, World Health Organization; 2014. 67 p.

16.

Braun V, Clarke V. Using thematic analysis in psychology. Qualitative Research in Psychology [Internet]. 2006 Jan [cited 2025 Dec 11];3(2):77–101. Available from: https://www.tandfonline.com/doi/full/10.1191/1478088706qp063oa

17.

Laplante S, Namazi B, Kiani P, Hashimoto DA, Alseidi A, Pasten M, et al. Validation of an artificial intelligence platform for the guidance of safe laparoscopic cholecystectomy. Surg Endosc [Internet]. 2023 Mar [cited 2025 Dec 11];37(3):2260–8. Available from: https://link.springer.com/10.1007/s00464-022-09439-9

18.

Madani A, Namazi B, Altieri MS, Hashimoto DA, Rivera AM, Pucher PH, et al. Artificial Intelligence for Intraoperative Guidance: Using Semantic Segmentation to Identify Surgical Anatomy During Laparoscopic Cholecystectomy. Annals of Surgery [Internet]. 2022 Aug [cited 2025 Dec 11];276(2):363–9. Available from: https://journals.lww.com/10.1097/SLA.0000000000004594

19.

Morse JM. The Significance of Saturation. Qual Health Res [Internet]. 1995 May [cited 2025 Dec 11];5(2):147–9. Available from: https://journals.sagepub.com/doi/10.1177/104973239500500201

20.

Creswell JW. Research design: qualitative, quantitative, and mixed methods approaches. 3rd ed. Thousand Oaks, Calif: Sage Publications; 2009. 260 p.

21.

Marcus HJ, Ramirez PT, Khan DZ, Layard Horsfall H, Hanrahan JG, Williams SC, et al. The IDEAL framework for surgical robotics: development, comparative evaluation and long-term monitoring. Nat Med [Internet]. 2024 Jan [cited 2025 Dec 11];30(1):61–75. Available from: https://www.nature.com/articles/s41591-023-02732-7.

22.

Vasey B, Nagendran M, Campbell B, Clifton DA, Collins GS, Denaxas S, et al. Reporting guideline for the early-stage clinical evaluation of decision support systems driven by artificial intelligence: DECIDE-AI. Nat Med [Internet]. 2022 May [cited 2025 Dec 11];28(5):924–33. Available from: https://www.nature.com/articles/s41591-022-01772-9

23.

Farrow L, Meek D, Leontidis G, Campbell M, Harrison E, Anderson L. The Clinical Practice Integration of Artificial Intelligence (CPI-AI) framework: a proposed application of IDEAL principles to artificial intelligence applications in trauma and orthopaedics. Bone Joint Res [Internet]. 2024 Sept 18 [cited 2025 Dec 11];13(9):507–12. Available from: https://boneandjoint.org.uk/doi/10.1302/2046-3758.139.BJR-2024-0135.R1

24.

Cruz Rivera S, Liu X, Chan AW, Denniston AK, Calvert MJ, Ashrafian H, et al. Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI extension. The Lancet Digital Health [Internet]. 2020 Oct [cited 2025 Dec 11];2(10):e549–60. Available from: https://linkinghub.elsevier.com/retrieve/pii/S2589750020302193

25.

Liu X, Cruz Rivera S, Moher D, Calvert MJ, Denniston AK, Ashrafian H, et al. Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension. The Lancet Digital Health [Internet]. 2020 Oct [cited 2025 Dec 11];2(10):e537–48. Available from: https://linkinghub.elsevier.com/retrieve/pii/S2589750020302181

26.

Greenhalgh Trisha, Abimbola Seye. The NASSS Framework – A Synthesis of Multiple Theories of Technology Implementation. In: Studies in Health Technology and Informatics [Internet]. IOS Press; 2019 [cited 2025 Dec 11]. Available from: https://www.medra.org/servlet/aliasResolver?alias=iospressISBN&isbn=978-1-61499-990-4&spage=193&doi=10.3233/SHTI190123

27.

Wheatley B. Transforming care delivery through health information technology. Perm J. 2013;17(1):81–6.

28.

Justinia T. The UK’s National Programme for IT: Why was it dismantled? Health Serv Manage Res [Internet]. 2017 Feb [cited 2025 Dec 11];30(1):2–9. Available from: https://journals.sagepub.com/doi/10.1177/0951484816662492

29.

Gotlib Conn L, McKenzie M, Pearsall EA, McLeod RS. Successful implementation of an enhanced recovery after surgery programme for elective colorectal surgery: a process evaluation of champions’ experiences. Implementation Sci [Internet]. 2015 Dec [cited 2025 Dec 11];10(1):99. Available from: https://implementationscience.biomedcentral.com/articles/10.1186/s13012-015-0289-y

30.

Khalid MU, Mac A, Biderman M, Errett L, Sriharan A. Partnering to build surgical capacity in low-resource settings: a qualitative study of Canadian global surgeons. BMJ Open [Internet]. 2023 Mar [cited 2025 Dec 11];13(3):e070148. Available from: https://bmjopen.bmj.com/lookup/doi/10.1136/bmjopen-2022-070148

31.

Latham SC. The Importance of a Physician Champion in Implementing Practice Change. American Journal of Infection Control [Internet]. 2006 June [cited 2025 Dec 11];34(5):E48. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0196655306007395

32.

Kewalramani D, Jawa RS, Martin CA, Gosain A, Wan D, Varghese TK, et al. Position statement from the society of University surgeons, surgical education committee: Artificial intelligence in surgical training for medical students, residents, and fellows. Surgery [Internet]. 2026 Feb [cited 2025 Dec 11];190:109849. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0039606025007019

33.

Rockall AG, Shelmerdine SC, Chen M. AI and ML in radiology: Making progress. Clinical Radiology [Internet]. 2023 Feb [cited 2025 Dec 11];78(2):81–2. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0009926022007085

Figure and Table Legends

Figure 1. Thematic Framework for Implementing Intraoperative Artificial Intelligence Decision Support Systems

Figure 2. Three-Phase Implementation Framework and Actionable Checklist for Intraoperative Artificial Intelligence

Table 1. Demographic and Practice Characteristics of Operating Room Personnel (Phase 1)

Table 2. Translating General Implementation Science Principles to Surgical AI-Specific Guidance

Comparison of implementation guidance provided by established frameworks (Consolidated Framework for Implementation Research [CFIR], Expert Recommendations for Implementing Change [ERIC], Replicating Effective Programs [REP]) versus context-specific guidance developed in this study for intraoperative artificial intelligence.

Yes