Segment anything model for medical images?

Y Huang, X Yang, L Liu, H Zhou, A Chang, X Zhou… - Medical Image …, 2024 - Elsevier
Abstract The Segment Anything Model (SAM) is the first foundation model for general image
segmentation. It has achieved impressive results on various natural image segmentation …

Ophnet: A large-scale video benchmark for ophthalmic surgical workflow understanding

M Hu, P Xia, L Wang, S Yan, F Tang, Z Xu… - … on Computer Vision, 2025 - Springer
Surgical scene perception via videos is critical for advancing robotic surgery, telesurgery,
and AI-assisted surgery, particularly in ophthalmology. However, the scarcity of diverse and …

[HTML][HTML] Methods and datasets for segmentation of minimally invasive surgical instruments in endoscopic images and videos: A review of the state of the art

T Rueckert, D Rueckert, C Palm - Computers in Biology and Medicine, 2024 - Elsevier
In the field of computer-and robot-assisted minimally invasive surgery, enormous progress
has been made in recent years based on the recognition of surgical instruments in …

Skit: a fast key information video transformer for online surgical phase recognition

Y Liu, J Huo, J Peng, R Sparks… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper introduces SKiT, a fast Key information Transformer for phase recognition of
videos. Unlike previous methods that rely on complex models to capture long-term temporal …

[HTML][HTML] Lovit: Long video transformer for surgical phase recognition

Y Liu, M Boels, LC Garcia-Peraza-Herrera… - Medical Image …, 2025 - Elsevier
Online surgical phase recognition plays a significant role towards building contextual tools
that could quantify performance and oversee the execution of surgical workflows. Current …

CholecTriplet2022: Show me a tool and tell me the triplet—An endoscopic vision challenge for surgical action triplet detection

CI Nwoye, T Yu, S Sharma, A Murali, D Alapatt… - Medical Image …, 2023 - Elsevier
Formalizing surgical activities as triplets of the used instruments, actions performed, and
target anatomies is becoming a gold standard approach for surgical activity modeling. The …

[HTML][HTML] Digital twins as a unifying framework for surgical data science: the enabling role of geometric scene understanding

H Ding, L Seenivasan, BD Killeen, SM Cho… - Artificial Intelligence …, 2024 - oaepublish.com
Surgical data science is devoted to enhancing the quality, safety, and efficacy of
interventional healthcare. While the use of powerful machine learning algorithms is …

Hecvl: Hierarchical video-language pretraining for zero-shot surgical phase recognition

K Yuan, V Srivastav, N Navab, N Padoy - International Conference on …, 2024 - Springer
Natural language could play an important role in developing generalist surgical models by
providing a broad source of supervision from raw texts. This flexible form of supervision can …

Surgformer: Surgical transformer with hierarchical temporal attention for surgical phase recognition

S Yang, L Luo, Q Wang, H Chen - International Conference on Medical …, 2024 - Springer
Existing state-of-the-art methods for surgical phase recognition either rely on the extraction
of spatial-temporal features at a short-range temporal resolution or adopt the sequential …

GMM-based Heuristic Decision Framework for Safe Automated Laparoscope Control

B Li, Y Lu, W Chen, B Lu, F Zhong… - IEEE Robotics and …, 2024 - ieeexplore.ieee.org
Automated laparoscope field of view (FoV) control in minimal invasive surgery (MIS) poses
challenges, as existing solutions failed to address dynamic surgical FoV requirements …