Collected molecules will appear here. Add from search or explore.
Automated disambiguation of ambiguous biomedical and clinical terms using word embedding vectors.
Defensibility
stars
14
forks
5
The 'yarn' project is a legacy research artifact from approximately 2014-2015, utilizing Word2Vec-era embeddings for clinical word sense disambiguation (WSD). With only 14 stars and no activity in nearly a decade, it represents a 'digital fossil' in the NLP space. Historically, it likely contributed to the transition from rule-based clinical NLP to vector-based methods, but it has been entirely superseded by Transformer-based models (BioBERT, ClinicalBERT) and modern LLMs which handle context and disambiguation inherently better without the need for static embedding lookups. From a competitive standpoint, tools like MedCAT, scispaCy, or specialized clinical LLMs (Med-PaLM, GPT-4 with medical prompting) provide orders of magnitude better performance and easier integration. The project lacks any modern defensive moat, such as a unique dataset, active community, or superior accuracy, and is essentially at 100% obsolescence risk.
TECH STACK
INTEGRATION
library_import
READINESS