Collected molecules will appear here. Add from search or explore.
Fine-tuning and applying Genome Foundation Models (GFMs) to predict the pathogenicity of DNA sequences from raw genomic data.
Defensibility
stars
6
forks
3
Patho-LM represents a common pattern in AI for Science: taking a pre-trained foundation model (like DNABERT or Nucleotide Transformer) and fine-tuning it for a specific clinical or biological task (pathogenicity prediction). While the work was presented at reputable workshops (MLCB 2024, ICML AI4Science 2024), the repository itself serves primarily as a static research artifact for those papers rather than a living software project. With only 6 stars and 3 forks over 1.5 years, it lacks developer adoption or community momentum. The technical moat is thin because the 'foundation' is typically a third-party model, and the pathogenicity 'head' is a standard downstream task. Frontier labs and specialized biotech AI firms (e.g., DeepMind with AlphaMissense, EvolutionaryScale with ESM, and InstaDeep) are aggressively releasing updated models that often outperform these specific fine-tuned versions or include them as part of a broader suite of zero-shot capabilities. Consequently, the displacement risk is very high as new, more capable genomic architectures (like Evo or Caduceus) emerge and render single-task fine-tuned models obsolete.
TECH STACK
INTEGRATION
reference_implementation
READINESS