Collected molecules will appear here. Add from search or explore.
200 molecules loaded — scroll down to load more
Showing 200 of 200 molecules
| Corpus | Title | Score | Threat | Novelty | Type | Traction | |
|---|---|---|---|---|---|---|---|
| GitHub | huggingface/transformers Model definition, loading, and fine-tuning framework for transformer-based architectures across text, vision, audio, and multimodal domains with unified APIs for inference and training. | 10 | HIGH | incremental | component | 159,006 | |
| GitHub | NVIDIA/TensorRT-LLM Official NVIDIA high-performance inference optimization library for Large Language Models on NVIDIA hardware, providing advanced kernels, quantization, and orchestration. | 10 | LOW | novel_combination | framework | 13,319 | |
| GitHub | PX4/PX4-Autopilot Open-source autopilot flight control software stack for multirotor, fixed-wing, and VTOL unmanned aerial vehicles (UAVs), providing real-time control algorithms, sensor fusion, navigation, and hardware abstraction across diverse embedded platforms. | 9 | LOW | incremental | framework | 11,461 | |
| GitHub | apache/airflow Workflow orchestration platform for authoring, scheduling, and monitoring data pipelines and ETL processes | 9 | MEDIUM | incremental | framework | 44,937 | |
| GitHub | autowarefoundation/autoware End-to-end open-source software stack for autonomous driving, providing modules for sensing, localization, perception, planning, and control. | 9 | LOW | novel_combination | framework | 11,344 | |
| GitHub | bytecodealliance/wasmtime Production-grade WebAssembly runtime with JIT compilation, sandboxing, and WASI support for executing WASM modules across multiple platforms | 9 | LOW | incremental | framework | 17,866 | |
| GitHub | ros2/ros2 Meta-operating system and middleware for robot development, providing communication, coordination, and tooling across distributed robot systems | 9 | LOW | incremental | framework | 5,312 | |
| GitHub | triton-inference-server/server Production-grade inference serving platform for deploying and managing machine learning models across cloud and edge environments with multi-backend support, batching, and dynamic loading. | 9 | MEDIUM | incremental | application | 10,527 | |
| GitHub | modelcontextprotocol/go-sdk Official Go implementation of the Model Context Protocol (MCP) for building interoperable AI clients and servers that connect LLMs to data sources and tools. | 9 | LOW | novel_combination | framework | 4,316 | |
| GitHub | modelcontextprotocol/modelcontextprotocol Model Context Protocol: A standard specification and reference implementation for connecting AI models to external data sources, tools, and context via a unified protocol | 9 | LOW | novel_combination | framework | 7,745 | |
| GitHub | vllm-project/vllm High-throughput, memory-efficient LLM inference and serving engine with optimized batching, KV-cache management, and multi-GPU/hardware support | 9 | HIGH | novel_combination | framework | 75,657 | |
| GitHub | langchain-ai/langchain LLM application framework providing abstractions for chaining language models, memory, retrieval, and agents with pluggable integrations across 100+ external services | 9 | HIGH | novel_combination | framework | 132,702 | |
| GitHub | langchain-ai/langgraph Framework for building stateful, multi-step agent applications using a graph-based execution model with built-in persistence, streaming, and human-in-the-loop capabilities. | 8 | HIGH | novel_combination | framework | 28,648 | |
| GitHub | open-edge-platform/anomalib A unified framework for deep-learning-based visual anomaly detection, providing SOTA algorithms, benchmarking tools, and deployment pipelines for industrial inspection. | 8 | MEDIUM | novel_combination | framework | 5,604 | |
| GitHub | microsoft/graphrag Graph-based Retrieval-Augmented Generation (RAG) system that extracts entities and relationships from documents to build knowledge graphs for improved LLM context retrieval | 8 | HIGH | novel_combination | component | 32,039 | |
| GitHub | ros-navigation/navigation2 Production-grade ROS 2 navigation framework providing autonomous robot path planning, localization, and costmap-based obstacle avoidance for mobile robotics | 8 | LOW | incremental | framework | 4,113 | |
| GitHub | bytedance/deer-flow Long-horizon agent orchestration framework enabling multi-step reasoning, code generation, and task execution through sandboxed environments, persistent memory, tool integration, and hierarchical subagent coordination. | 8 | HIGH | novel_combination | framework | 57,861 | |
| GitHub | sgl-project/sglang High-performance serving framework for large language models and multimodal models with optimized inference execution and structured generation capabilities. | 8 | HIGH | novel_combination | framework | 25,525 | |
| GitHub | DependencyTrack/dependency-track Software supply chain risk management and component analysis platform for identifying vulnerabilities, license compliance, and dependencies across software projects | 8 | MEDIUM | incremental | application | 3,730 | |
| GitHub | langgenius/dify Production-ready low-code platform for building, deploying, and managing AI agent workflows and LLM applications with visual builder, RAG capabilities, and multi-model support | 8 | HIGH | reimplementation | application | 136,679 | |
| GitHub | NVIDIA/Isaac-GR00T Foundation model and inference framework for generalist robot control and perception, enabling multi-modal learning across diverse robotic morphologies and tasks | 8 | HIGH | novel_combination | framework | 6,596 | |
| GitHub | WasmEdge/WasmEdge Lightweight, high-performance WebAssembly runtime for edge computing, serverless, and IoT with extensible plugin architecture | 8 | MEDIUM | novel_combination | framework | 10,557 | |
| GitHub | microsoft/mcp Official catalog and reference implementations of Model Context Protocol (MCP) servers for standardizing AI model access to tools, data sources, and external systems | 8 | HIGH | novel_combination | framework | 2,923 | |
| GitHub | microsoft/agent-framework A multi-agent orchestration framework that enables complex task solving through automated conversations between multiple customizable AI agents, supporting both Python and .NET ecosystems. | 8 | MEDIUM | novel_combination | framework | 9,129 | |
| GitHub | bytecodealliance/wasm-micro-runtime Lightweight WebAssembly runtime optimized for embedded systems, IoT, and resource-constrained environments. Provides a compact, portable alternative to full WASM runtimes with JIT, interpreter, and AOT compilation modes. | 8 | MEDIUM | incremental | framework | 5,879 | |
| GitHub | autowarefoundation/autoware_universe Modular, production-grade autonomous driving software stack with perception, planning, control, and simulation components for real-world and simulated vehicle deployment | 8 | MEDIUM | reimplementation | framework | 1,568 | |
| GitHub | langchain-ai/langgraphjs Orchestration framework for building stateful, multi-agent applications using graph-based logic with support for cycles and persistence. | 8 | MEDIUM | novel_combination | framework | 2,755 | |
| GitHub | Tencent/ncnn High-performance neural network inference framework optimized for mobile and embedded platforms with minimal dependencies and extreme efficiency | 8 | MEDIUM | incremental | framework | 23,054 | |
| GitHub | pydantic/pydantic-ai Framework for building AI agents with structured outputs, tool use, and multi-model support via Pydantic validation | 8 | MEDIUM | novel_combination | framework | 16,164 | |
| GitHub | tuya/TuyaOpen Cross-platform hardware abstraction layer and SDK for building AI-integrated IoT devices across multiple chip architectures (ESP32, Tuya-specific silicon, etc.). | 8 | LOW | novel_combination | framework | 1,486 | |
| GitHub | PrefectHQ/prefect Workflow orchestration framework for building, scheduling, and monitoring resilient data pipelines in Python | 8 | MEDIUM | incremental | framework | 22,094 | |
| GitHub | lance-format/lance Open lakehouse data format optimized for multimodal AI workloads, providing fast random access, vector indexing, and data versioning with seamless integration across data science ecosystems. | 8 | MEDIUM | novel_combination | framework | 6,282 | |
| GitHub | HKUDS/LightRAG Lightweight, fast retrieval-augmented generation (RAG) system using graph-based entity and relationship extraction for efficient document retrieval and LLM augmentation | 8 | HIGH | novel_combination | component | 32,476 | |
| GitHub | BerriAI/litellm Unified LLM API abstraction layer and gateway that normalizes requests/responses across 100+ LLM providers into OpenAI-compatible format, with cost tracking, load balancing, and guardrails. | 8 | MEDIUM | novel_combination | framework | 42,519 | |
| GitHub | genkit-ai/genkit A production-grade developer framework for building, deploying, and monitoring AI-powered applications with support for multi-language (JS/Go/Python) environments and deep Google Cloud/Firebase integration. | 8 | MEDIUM | novel_combination | framework | 5,771 | |
| GitHub | deepset-ai/haystack An end-to-end LLM orchestration framework focused on modular, production-grade pipelines for RAG, agentic workflows, and semantic search. | 8 | MEDIUM | novel_combination | framework | 24,759 | |
| GitHub | Mintplex-Labs/anything-llm All-in-one AI productivity platform combining document management, multi-model LLM support, RAG, and workspace collaboration with on-device execution and privacy-first architecture | 8 | MEDIUM | novel_combination | application | 57,858 | |
| GitHub | usnistgov/OSCAL Open standard for expressing, sharing, and validating security control definitions, assessments, and compliance documentation across organizations and tools | 8 | LOW | novel_combination | framework | 867 | |
| GitHub | airbytehq/airbyte Data integration platform providing ETL/ELT pipelines connecting APIs, databases, and files to data warehouses, lakes, and lakehouses with self-hosted and cloud deployment options. | 8 | MEDIUM | incremental | framework | 21,034 | |
| GitHub | n8n-io/n8n Visual workflow automation platform with 400+ integrations, AI capabilities, and hybrid code/no-code execution model. Self-hostable or SaaS, targeting enterprise automation use cases. | 8 | HIGH | incremental | framework | 182,937 | |
| GitHub | anchore/syft CLI tool and library for generating Software Bill of Materials (SBOM) from container images and filesystems by scanning packages and files. | 8 | LOW | novel_combination | application | 8,668 | |
| GitHub | ros-controls/ros2_control Generic control framework for ROS 2 enabling hardware abstraction, controller management, and real-time control of robotic systems | 8 | LOW | incremental | framework | 857 | |
| GitHub | modelcontextprotocol/csharp-sdk Official C# SDK for implementing Model Context Protocol (MCP) servers and clients, enabling .NET applications to interact with LLM-powered services via a standardized protocol. | 8 | HIGH | reimplementation | component | 4,174 | |
| GitHub | CycloneDX/cyclonedx-node-module Generates CycloneDX Software Bill of Materials (SBOM) for Node.js projects, identifying components, licenses, and dependencies for security and compliance tracking. | 7 | LOW | reimplementation | component | 141 | |
| arXiv | AgentRFC: Security Design Principles and Conformance Testing for Agent Protocols Security design framework and conformance testing methodology for AI agent protocols (MCP, A2A, ANP, ACP). Defines a 6-layer architectural model for agent protocol security and provides systematic testing approach. | 7 | HIGH | novel_combination | framework | 0 | |
| GitHub | CycloneDX/cyclonedx-gradle-plugin Automates the generation of CycloneDX Software Bill of Materials (SBOM) for Gradle-based projects by analyzing the dependency graph during the build process. | 7 | LOW | reimplementation | component | 219 | |
| GitHub | lobehub/lobehub Multi-agent collaboration platform and IDE for building, deploying, and orchestrating AI agent teams with plugin ecosystem and conversational UI | 7 | HIGH | novel_combination | framework | 74,891 | |
| GitHub | StarRocks/starrocks Distributed SQL query engine optimized for sub-second analytics on data lakehouses and warehouses, supporting real-time and ad-hoc analytical workloads | 7 | MEDIUM | incremental | framework | 11,552 | |
| GitHub | modelscope/ms-swift Unified fine-tuning framework for 600+ LLMs and 300+ MLLMs using PEFT/full-parameter training with support for CPT/SFT/DPO/GRPO methods | 7 | HIGH | incremental | framework | 13,587 | |
| GitHub | QMCPACK/qmcpack Production-grade quantum Monte Carlo simulation engine for ab initio electronic structure calculations with portable GPU acceleration across diverse hardware platforms | 7 | MEDIUM | incremental | component | 383 | |
| GitHub | livekit/agents Framework for building real-time voice and video AI agents with live communication capabilities | 7 | HIGH | incremental | framework | 9,963 | |
| GitHub | PennyLaneAI/pennylane Open-source quantum computing software platform enabling quantum algorithm development, quantum machine learning, and quantum chemistry simulations with hardware-agnostic abstractions across multiple quantum backends. | 7 | HIGH | novel_combination | component | 3,139 | |
| GitHub | kvcache-ai/Mooncake Production serving platform for large language models, specifically designed and deployed for Moonshot AI's Kimi LLM service at scale | 7 | HIGH | incremental | component | 5,054 | |
| GitHub | activepieces/activepieces Open-source workflow automation and AI agent orchestration platform with ~400 MCP (Model Context Protocol) server integrations for AI-driven task automation | 7 | HIGH | novel_combination | framework | 21,612 | |
| GitHub | openvinotoolkit/openvino Open-source toolkit for optimizing and deploying AI model inference across CPUs, GPUs, and edge devices with support for multiple frameworks (TensorFlow, PyTorch, ONNX) | 7 | HIGH | incremental | component | 10,029 | |
| GitHub | agentic-community/mcp-gateway-registry Enterprise-grade centralized registry and gateway for Model Context Protocol (MCP) servers, providing unified authentication, dynamic tool discovery, and auditable access control for AI agents and coding assistants. | 7 | MEDIUM | novel_combination | framework | 562 | |
| GitHub | FalkorDB/FalkorDB High-performance graph database optimized for knowledge graph workloads and LLM integration, built on sparse matrix algebra (GraphBLAS) for efficient graph traversal and pattern matching. | 7 | MEDIUM | novel_combination | component | 3,898 | |
| GitHub | CycloneDX/cyclonedx-python Official CycloneDX library and CLI for generating Software Bill of Materials (SBOM) for Python projects, supporting multiple package managers and dependency formats. | 7 | LOW | reimplementation | application | 365 | |
| GitHub | langfuse/langfuse Open-source LLM observability and engineering platform providing tracing, metrics, evals, prompt management, and experimentation capabilities for LLM applications | 7 | HIGH | novel_combination | component | 24,512 | |
| GitHub | NVIDIA/Model-Optimizer Unified model optimization library providing quantization, pruning, distillation, and speculative decoding to compress deep learning models for inference deployment on NVIDIA hardware and frameworks | 7 | HIGH | incremental | component | 2,399 | |
| GitHub | boa-dev/boa Embeddable JavaScript engine implementation in Rust, providing ECMAScript interpreter and runtime for Rust applications | 7 | MEDIUM | novel_combination | component | 7,161 | |
| GitHub | pathwaycom/pathway Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG with incremental computation and unified batch/stream semantics | 7 | MEDIUM | novel_combination | framework | 63,425 | |
| GitHub | open-compass/VLMEvalKit Comprehensive evaluation toolkit for vision-language models (VLMs/LMMs), providing standardized benchmarking across 220+ models and 80+ datasets with unified inference and scoring pipelines. | 7 | MEDIUM | novel_combination | framework | 4,010 | |
| GitHub | assistant-ui/assistant-ui React component library and typescript SDK for building conversational AI/chatbot interfaces, providing composable UI components, state management, and real-time streaming support | 7 | HIGH | incremental | framework | 9,219 | |
| GitHub | eclipse-zenoh/zenoh-plugin-ros2dds Zenoh plugin that enables ROS2 to use Zenoh as a DDS RMW (ROS Middleware), providing an alternative to standard DDS implementations with improved scalability and edge computing capabilities. | 7 | MEDIUM | novel_combination | component | 254 | |
| GitHub | crewAIInc/crewAI Framework for orchestrating multiple autonomous AI agents with role-playing and collaborative task execution | 7 | HIGH | novel_combination | framework | 48,283 | |
| GitHub | instadeepai/nucleotide-transformer Foundation models for genomic and transcriptomic sequence understanding, providing pre-trained transformers for DNA/RNA analysis and downstream task adaptation | 7 | MEDIUM | novel_combination | component | 847 | |
| GitHub | deepflowio/deepflow eBPF-based distributed tracing and profiling platform for cloud-native infrastructure observability without code instrumentation | 7 | MEDIUM | novel_combination | framework | 3,993 | |
| GitHub | Avaiga/taipy Low-code web framework for converting data/ML algorithms into production-ready applications with integrated data pipelines, scenario management, and interactive UI components | 7 | MEDIUM | novel_combination | framework | 19,156 | |
| GitHub | open-webui/open-webui Web-based conversational UI for interfacing with multiple LLM backends (Ollama, OpenAI, etc.) with chat history, RAG, and agent capabilities | 7 | MEDIUM | incremental | application | 130,527 | |
| GitHub | apache/incubator-xtable Cross-table format converter enabling interoperability between lakehouse table formats (Iceberg, Delta Lake, Hudi) for query engines and data processing systems | 7 | MEDIUM | novel_combination | framework | 1,178 | |
| GitHub | Skyvern-AI/skyvern AI-driven browser automation and workflow orchestration platform that uses vision and language models to automatically execute complex web-based tasks without explicit scripting | 7 | HIGH | novel_combination | component | 21,082 | |
| GitHub | NVIDIA-NeMo/DataDesigner Generate high-quality synthetic data from scratch or seed data for training machine learning models, with focus on domain-specific data synthesis and quality control. | 7 | HIGH | novel_combination | framework | 1,488 | |
| GitHub | dylibso/chicory JVM-native WebAssembly runtime enabling execution of WASM binaries on the Java Virtual Machine without external dependencies | 7 | MEDIUM | novel_combination | library | 1,047 | |
| GitHub | lakesoul-io/LakeSoul Cloud-native lakehouse framework enabling real-time data ingestion, concurrent ACID updates, and incremental analytics on object storage with BI/AI support | 7 | MEDIUM | novel_combination | framework | 3,229 | |
| GitHub | chainloop-dev/chainloop SDLC evidence store and policy engine for software supply chain attestations, SBOMs, VEX, SARIF, and QA reports with centralized governance and compliance verification | 7 | MEDIUM | novel_combination | framework | 542 | |
| GitHub | TouK/nussknacker Low-code visual workflow engine for real-time stream processing and data automation; enables non-technical users to design, deploy, and monitor streaming data pipelines without writing code | 7 | MEDIUM | novel_combination | framework | 716 | |
| GitHub | eclipse-zenoh/zenoh A unified pub/sub and data fabric platform that integrates messaging, distributed storage, queries, and computation with geo-distributed capabilities and high efficiency for edge-to-cloud deployments. | 7 | MEDIUM | novel_combination | framework | 2,611 | |
| GitHub | hazelcast/hazelcast Unified real-time data platform: distributed in-memory data store with integrated stream processing and event-driven architecture | 7 | MEDIUM | incremental | framework | 6,610 | |
| GitHub | confluentinc/ksql KSQL: SQL database engine purpose-built for stream processing and real-time analytics on Apache Kafka | 7 | MEDIUM | novel_combination | framework | 294 | |
| GitHub | nasa/ogma Automated runtime monitor generation for safety-critical aerospace and robotics systems from formal specifications | 7 | MEDIUM | novel_combination | component | 553 | |
| GitHub | trailbaseio/trailbase Open-source, self-hosted Firebase alternative with sub-millisecond latency, type-safe APIs, WebAssembly runtime, real-time subscriptions, and built-in auth/admin UI | 7 | MEDIUM | novel_combination | framework | 4,737 | |
| GitHub | Profluent-AI/OpenCRISPR AI-generated gene editing systems using machine learning to design novel CRISPR variants and improve editing efficiency | 7 | MEDIUM | novel_combination | framework | 1,182 | |
| GitHub | SCHUNK-SE-Co-KG/schunk_svh_ros_driver Official ROS1 and ROS2 hardware driver for the Schunk SVH (Schunk Five-finger Hand), providing low-level motor control and sensor feedback for dexterous robotic manipulation. | 7 | LOW | reimplementation | component | 19 | |
| GitHub | CycloneDX/cyclonedx-dotnet Generates CycloneDX-compliant Software Bill of Materials (SBOM) for .NET projects and solutions by analyzing NuGet dependencies. | 7 | LOW | reimplementation | application | 262 | |
| GitHub | pixeltable/pixeltable Data infrastructure for multimodal AI workloads with declarative, incremental computation and versioning | 7 | MEDIUM | novel_combination | framework | 1,624 | |
| GitHub | katanemo/plano AI-native proxy and data plane for agentic applications with orchestration, safety, observability, and LLM routing capabilities | 7 | HIGH | novel_combination | framework | 6,228 | |
| GitHub | ytsaurus/ytsaurus Distributed data processing and storage platform for petabyte-scale analytics with fault tolerance, scheduling, and multi-tenancy | 7 | LOW | reimplementation | framework | 2,151 | |
| GitHub | apache/gravitino Open-source federated metadata catalog system for managing and governing multi-source data assets across distributed environments with unified metadata discovery, lineage tracking, and access control. | 7 | MEDIUM | novel_combination | framework | 2,944 | |
| GitHub | dbt-labs/dbt-mcp MCP (Model Context Protocol) server enabling LLMs and AI assistants to interact with dbt projects, query lineage, execute commands, and retrieve metadata | 7 | HIGH | novel_combination | component | 529 | |
| GitHub | databendlabs/databend Cloud-native data warehouse with unified architecture for analytics, search, and AI workloads, built on S3 with Python sandbox capabilities | 7 | MEDIUM | novel_combination | framework | 9,236 | |
| GitHub | mongodb-js/mongodb-mcp-server Model Context Protocol (MCP) server enabling LLM agents to connect to, query, and interact with MongoDB databases and Atlas clusters through a standardized interface | 7 | HIGH | novel_combination | component | 995 | |
| GitHub | sdv-dev/SDV Generate synthetic tabular data that preserves statistical properties and relationships while protecting privacy | 7 | MEDIUM | novel_combination | component | 3,463 | |
| GitHub | alibaba/MNN High-performance neural network inference engine optimized for on-device and edge deployment, with quantization, model compression, and multi-platform support (mobile, IoT, cloud). | 7 | MEDIUM | incremental | component | 14,818 | |
| GitHub | Tencent/AI-Infra-Guard Full-stack AI red teaming and security scanning platform for AI ecosystems, covering LLM jailbreaks, agent vulnerabilities, skill exploits, MCP protocol flaws, and infrastructure weaknesses | 7 | MEDIUM | novel_combination | application | 3,410 | |
| GitHub | bentoml/BentoML Framework for serving machine learning models and AI applications as APIs, with support for batching, multi-model pipelines, async job queues, and LLM app deployment. | 7 | MEDIUM | incremental | framework | 8,563 | |
| GitHub | oracle/macaron Extensible supply-chain security analysis framework for detecting malicious packages, validating build systems, and enforcing SLSA compliance across CI/CD pipelines | 7 | MEDIUM | novel_combination | framework | 190 | |
| GitHub | YZY-stack/DF40 Comprehensive deepfake detection dataset and benchmark covering 40 distinct deepfake generation techniques, including state-of-the-art methods, with evaluation framework for detector generalization. | 7 | MEDIUM | novel_combination | framework | 331 | |
| GitHub | onyx-dot-app/onyx Open-source enterprise AI chat platform with multi-LLM support, RAG capabilities, and knowledge base integration | 7 | HIGH | incremental | application | 25,906 | |
| GitHub | tqec/tqec Design automation and simulation framework for topological quantum error correction (TQEC) codes, enabling researchers to model and optimize fault-tolerant quantum computing architectures | 7 | MEDIUM | novel_combination | framework | 348 | |
| arXiv | A plug-and-play superconducting quantum controller at millikelvin temperatures enables exceeding 99.9% average gate fidelity Superconducting quantum controller hardware enabling high-fidelity qubit gate operations (>99.9%) at millikelvin temperatures through direct chip-to-chip interconnection and all-digital control | 7 | HIGH | novel_combination | component | 0 | |
| GitHub | vercel-labs/agent-browser Browser automation CLI tool enabling AI agents to programmatically control web browsers for task execution and web interaction | 7 | HIGH | incremental | component | 27,837 | |
| GitHub | lava-nc/lava Software framework for building, simulating, and deploying neuromorphic computing applications using spiking neural networks (SNNs) and event-driven architectures | 7 | MEDIUM | novel_combination | framework | 711 | |
| GitHub | langfuse/langfuse-python Python SDK for LLM application instrumentation and tracing, providing observability through decorators and low-level APIs | 7 | MEDIUM | novel_combination | framework | 377 | |
| GitHub | NVIDIA/cudaqx Accelerated quantum-classical computing libraries built on NVIDIA's CUDA-Q framework, providing GPU-optimized subroutines for hybrid quantum algorithms and quantum circuit simulation. | 7 | HIGH | incremental | component | 90 | |
| GitHub | Gaius-Augustus/BRAKER Automated pipeline for predicting protein-coding gene structures in novel eukaryotic genomes using GeneMark and AUGUSTUS | 7 | LOW | incremental | application | 451 | |
| GitHub | unslothai/unsloth Web UI and optimization framework for fine-tuning and running open-source LLMs locally with reduced memory/compute requirements | 7 | HIGH | novel_combination | component | 60,068 | |
| GitHub | screenpipe/screenpipe Continuous screen and audio capture with local AI indexing to enable agents that understand user context and automate tasks based on real-time activity | 7 | HIGH | novel_combination | framework | 18,070 | |
| GitHub | CopilotKit/CopilotKit Frontend framework and SDK for building agentic UI with LLM integration, providing React/Angular components, state management, and agent orchestration for generative applications | 7 | HIGH | novel_combination | framework | 30,063 | |
| GitHub | CycloneDX/cyclonedx-gomod Official CycloneDX tool for generating Software Bill of Materials (SBOMs) from Go modules, providing visibility into software supply chains. | 7 | LOW | reimplementation | component | 179 | |
| GitHub | meta-pytorch/MSLK PyTorch GPU operator library optimized for GenAI training/inference with FP8 quantization and collective communications primitives | 6 | HIGH | incremental | component | 94 | |
| GitHub | python-streamz/streamz Real-time stream processing library for Python with support for reactive programming patterns, lazy evaluation, and distributed computation | 6 | MEDIUM | incremental | framework | 1,295 | |
| GitHub | tiiuae/sbomnix Generate and analyze Software Bill of Materials (SBOM) for Nix-based software supply chains, with vulnerability scanning and dependency analysis | 6 | LOW | novel_combination | application | 252 | |
| GitHub | latticesurgery-com/lattice-surgery-compiler Quantum error correction compiler implementing lattice surgery surface code techniques for fault-tolerant quantum computing | 6 | MEDIUM | novel_combination | component | 84 | |
| GitHub | LearningCircuit/local-deep-research Local multi-source deep research pipeline with 95% SimpleQA benchmark performance; searches 10+ sources (arXiv, PubMed, web, private docs) with support for local and cloud LLMs, privacy-first architecture | 6 | HIGH | novel_combination | application | 4,280 | |
| GitHub | qiboteam/qibo Full-stack quantum computing framework providing high-level abstractions for circuit definition, simulation, and execution on quantum hardware backends | 6 | HIGH | incremental | framework | 348 | |
| GitHub | quic/aimet Advanced quantization and compression techniques for neural network models with post-training and training-aware optimization | 6 | HIGH | incremental | component | 2,587 | |
| GitHub | vxcontrol/pentagi Fully autonomous AI agents system for penetration testing and security assessment automation | 6 | HIGH | novel_combination | framework | 14,476 | |
| GitHub | gptme/gptme Terminal-based AI agent that writes code, executes shell commands, and browses the web with persistent state and tool composition | 6 | HIGH | novel_combination | framework | 4,263 | |
| arXiv | Latent-Y: A Lab-Validated Autonomous Agent for De Novo Drug Design Autonomous AI agent for end-to-end antibody design and drug discovery, executing literature review through computational validation and candidate selection via multi-stage workflow orchestration | 6 | HIGH | novel_combination | application | 0 | |
| GitHub | Ekumen-OS/beluga Production-grade C++17 implementation of Monte Carlo Localization (MCL) algorithms with ROS 1/2 integration for robot pose estimation | 6 | MEDIUM | incremental | component | 315 | |
| GitHub | GenerTeam/GENERanno Genomic foundation model for automated metagenomic sequence annotation and functional classification | 6 | MEDIUM | novel_combination | component | 309 | |
| GitHub | agent0ai/agent-zero Autonomous AI agent framework enabling multi-step reasoning, tool use, and file/code manipulation with local LLM support and extensible architecture | 6 | HIGH | incremental | framework | 16,830 | |
| GitHub | gpustack/gpustack GPU cluster manager for orchestrating distributed inference engines (vLLM, SGLang) with automatic configuration, scheduling, and resource management across heterogeneous GPU hardware. | 6 | HIGH | incremental | application | 4,795 | |
| GitHub | datacommonsorg/agent-toolkit Agent toolkit for querying and interacting with the Data Commons Knowledge Graph via Model Context Protocol (MCP) servers | 6 | MEDIUM | novel_combination | framework | 138 | |
| arXiv | Triangle Multiplication Is All You Need For Biomolecular Structure Representations Pairmixer: A computationally efficient alternative to AlphaFold3's Pairformer backbone for biomolecular structure prediction, replacing expensive triangle attention with streamlined triangular primitives to enable large-scale protein folding applications. | 6 | HIGH | novel_combination | component | 0 | |
| GitHub | infiniflow/ragflow Open-source RAG (Retrieval-Augmented Generation) engine with agent capabilities, providing a context layer for LLMs with document ingestion, chunking, vector search, and agentic orchestration. | 6 | HIGH | incremental | component | 77,400 | |
| GitHub | intel/auto-round Automated quantization of large language models to low-bit precision using rounding-based algorithms, optimized for CPU/XPU/CUDA inference with broad framework compatibility | 6 | HIGH | novel_combination | component | 944 | |
| GitHub | NVIDIA-AI-Blueprints/rag Reference implementation of a Retrieval Augmented Generation (RAG) pipeline with NVIDIA optimizations for vector search, embedding, and LLM inference. | 6 | HIGH | reimplementation | framework | 550 | |
| GitHub | openlit/openlit OpenTelemetry-native observability platform for LLM applications, providing integrated monitoring, guardrails, evaluations, prompt management, and multi-provider instrumentation. | 6 | HIGH | incremental | framework | 2,349 | |
| GitHub | containers/ramalama Container-native framework for local AI model serving and inference management with OCI-compliant abstraction across heterogeneous model sources | 6 | MEDIUM | novel_combination | application | 2,693 | |
| GitHub | quantumlib/chromobius Möbius decoder implementation for color codes in quantum error correction | 6 | MEDIUM | novel_combination | component | 30 | |
| GitHub | CherryHQ/cherry-studio Desktop AI productivity studio providing unified chat interface, autonomous agents, and 300+ pre-built assistants with multi-LLM support (OpenAI, Claude, Gemini, local models) | 6 | HIGH | incremental | application | 43,123 | |
| GitHub | microsoft/mcp-for-beginners Educational curriculum and reference implementation for Model Context Protocol (MCP) across multiple programming languages | 6 | HIGH | reimplementation | framework | 15,824 | |
| GitHub | Open-Source-Legal/OpenContracts Self-hosted document annotation, semantic search, and knowledge base construction platform with support for human-AI collaboration and MCP integration | 6 | MEDIUM | novel_combination | application | 1,264 | |
| GitHub | f/prompts.chat Community-driven prompt repository and discovery platform for sharing, organizing, and self-hosting ChatGPT/LLM prompts with privacy controls | 6 | HIGH | incremental | application | 157,876 | |
| GitHub | rdk/p2rank Machine learning-based prediction of protein-ligand binding sites from 3D protein structures using structural features and trained classifiers | 6 | MEDIUM | novel_combination | application | 410 | |
| GitHub | Agent-Field/agentfield Framework for deploying AI agents as scalable microservices with built-in observability, identity management, and multi-tenant isolation. | 6 | HIGH | novel_combination | framework | 1,370 | |
| GitHub | embabel/embabel-agent JVM-based agent framework for building autonomous systems with multi-agent orchestration, message passing, and distributed task execution | 6 | MEDIUM | incremental | framework | 3,258 | |
| GitHub | APPFL/APPFL Privacy-preserving federated learning framework with differential privacy and secure aggregation for distributed machine learning | 6 | MEDIUM | novel_combination | framework | 174 | |
| GitHub | datachain-ai/datachain Analytics, versioning, and ETL framework for multimodal data (video, audio, PDFs, images) with built-in data lineage and versioning capabilities | 6 | MEDIUM | novel_combination | component | 2,736 | |
| GitHub | PennyLaneAI/pennylane-qiskit Integration plugin bridging PennyLane quantum machine learning framework with IBM Qiskit quantum computing backend and IBM Q hardware | 6 | MEDIUM | derivative | component | 230 | |
| GitHub | genomoncology/biomcp Model Context Protocol (MCP) server implementation for biomedical data access and LLM integration, enabling AI models to query genomic and clinical databases through standardized interfaces | 6 | MEDIUM | novel_combination | framework | 483 | |
| GitHub | dstackai/dstack Control plane for provisioning and orchestrating GPU compute across heterogeneous infrastructure (cloud, Kubernetes, bare-metal) and diverse accelerators (NVIDIA, AMD, TPU, Tenstorrent) | 6 | HIGH | incremental | framework | 2,083 | |
| GitHub | MyersResearchGroup/iBioSim Computer-aided design (CAD) tool for modeling, analysis, and design of genetic circuits with SBML/SBOL support | 6 | LOW | incremental | application | 65 | |
| GitHub | qualcomm/ai-hub-models Collection of pre-optimized ML models targeting Qualcomm device deployment with performance optimization for latency and memory constraints | 6 | MEDIUM | reimplementation | framework | 977 | |
| GitHub | moltis-org/moltis Secure, self-hosted personal agent server supporting multi-provider LLMs, voice I/O, and integrations with messaging platforms (Telegram, WhatsApp, Discord, Teams) plus MCP tools, with sandboxed execution and persistent memory. | 6 | HIGH | novel_combination | application | 2,517 | |
| GitHub | CoplayDev/unity-mcp Model Context Protocol (MCP) server that bridges AI assistants (Claude, Cursor) with Unity Editor, enabling LLM-driven automation of asset management, scene control, script editing, and task orchestration. | 6 | HIGH | novel_combination | component | 8,156 | |
| arXiv | Accurate RNA 3D structure prediction using a language model-based deep learning approach Predict 3D RNA structures from sequence using language model-based deep learning | 6 | HIGH | novel_combination | algorithm | 0 | |
| GitHub | polyuiislab/infiAgent Configuration-driven agent framework for building long-horizon autonomous agents with multi-turn reasoning, skill composition, and complex task orchestration without code modifications. | 6 | HIGH | novel_combination | framework | 1,156 | |
| GitHub | autowarefoundation/agnocast Zero-copy inter-process communication (IPC) middleware for ROS 2, enabling efficient message passing between processes using shared memory without serialization overhead. | 6 | MEDIUM | novel_combination | component | 179 | |
| GitHub | github/CopilotForXcode AI-powered code completion and assistance plugin for Apple's Xcode IDE | 6 | HIGH | reimplementation | application | 5,963 | |
| GitHub | GenerTeam/GENERator Long-context generative foundation model for genomic sequence generation and analysis, trained on large-scale DNA/RNA datasets | 6 | HIGH | novel_combination | framework | 449 | |
| arXiv | GENERator: A Long-Context Generative Genomic Foundation Model Long-context generative foundation model for DNA sequence modeling and genomic function interpretation, supporting 98k nucleotide context windows | 6 | HIGH | novel_combination | framework | 0 | |
| GitHub | campfirein/byterover-cli Portable memory abstraction layer for autonomous coding agents, enabling context persistence and retrieval across agent sessions | 6 | MEDIUM | novel_combination | component | 4,361 | |
| GitHub | MarcToussaint/robotic Python robotics control and manipulation planning library with motion planning, collision checking, and trajectory optimization for physical and simulated robots | 6 | MEDIUM | novel_combination | framework | 137 | |
| arXiv | Gene42: Long-Range Genomic Foundation Model With Dense Attention Long-context genomic foundation model enabling dense attention over 192,000 base pair sequences for DNA/RNA analysis and generation | 6 | HIGH | novel_combination | framework | 0 | |
| arXiv | CONFIDE: Hallucination Assessment for Reliable Biomolecular Structure Prediction and Design Hallucination detection and confidence assessment for protein structure predictions by analyzing topological frustration via diffusion embeddings from AlphaFold3 | 6 | HIGH | novel_combination | algorithm | 0 | |
| GitHub | InternLM/xtuner Fine-tuning and training engine for large language models (LLMs) and mixture-of-experts (MoE) models, with support for low-rank adaptation (LoRA), quantization, and distributed training | 6 | HIGH | incremental | component | 5,117 | |
| GitHub | Softeria/ms-365-mcp-server MCP server enabling LLM agents to interact with Microsoft 365 services (Teams, OneDrive, SharePoint, Exchange, Outlook) via the Microsoft Graph API | 6 | HIGH | incremental | component | 596 | |
| GitHub | Kiln-AI/Kiln Integrated platform for building, evaluating, and optimizing AI systems with support for evals, RAG, agents, fine-tuning, synthetic data generation, and dataset management. | 6 | HIGH | incremental | framework | 4,741 | |
| arXiv | Tsim: Fast Universal Simulator for Quantum Error Correction High-throughput GPU-accelerated simulator for noisy quantum circuits with native quantum error correction support, using ZX diagram representation and Pauli channel modeling. | 6 | HIGH | novel_combination | component | 0 | |
| arXiv | Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questions Large-scale dataset of 2,000+ individuals with 500+ question responses for training and validating LLM-based digital twin models that simulate human behavior | 6 | MEDIUM | novel_combination | component | 0 | |
| GitHub | NeptuneHub/AudioMuse-AI Local audio analysis and automatic playlist generation for self-hosted music servers (Jellyfin, Navidrome, LMS, Emby) using ONNX-based sonic feature extraction and ML-driven curation | 6 | LOW | novel_combination | application | 1,526 | |
| GitHub | nf-core/crisprseq Nextflow pipeline for CRISPR gene editing analysis, supporting targeted NGS quality assessment and pooled CRISPR screening data analysis | 6 | LOW | incremental | framework | 57 | |
| GitHub | OpenBMB/UltraRAG Low-code framework for constructing modular RAG (Retrieval-Augmented Generation) pipelines with multi-stage orchestration, entity extraction, and flexible data flow composition | 6 | MEDIUM | novel_combination | framework | 5,475 | |
| GitHub | nottelabs/notte Framework for building web agents and deploying serverless web automation functions with managed browser infrastructure | 6 | MEDIUM | novel_combination | framework | 1,929 | |
| GitHub | getsentry/XcodeBuildMCP MCP server and CLI tooling for iOS/macOS development automation and agent-assisted Xcode project workflows | 6 | MEDIUM | novel_combination | framework | 5,074 | |
| GitHub | JacopoPan/aerial-autonomy-stack End-to-end framework for simulating and deploying autonomous drone swarms with perception (YOLO, LiDAR) on PX4/ArduPilot autopilots, integrated with ROS2 and NVIDIA Jetson hardware | 6 | MEDIUM | novel_combination | framework | 374 | |
| arXiv | Path-Constrained Mixture-of-Experts Novel theoretical framework for understanding and optimizing Mixture-of-Experts (MoE) architectures by modeling token routing as constrained expert paths that align with linguistic function, enabling sparse computation and improved efficiency. | 6 | MEDIUM | novel_combination | algorithm | 0 | |
| GitHub | XmirrorSecurity/OpenSCA-cli Open source software supply chain security scanner for dependency detection, vulnerability identification, and license compliance analysis | 6 | MEDIUM | incremental | application | 1,134 | |
| GitHub | kac89/vulnrepo End-to-end encrypted vulnerability report generator and management system with multi-source import (Nmap, Nessus, Burp, OpenVAS, Bugcrowd, Trivy) and templating for compliance frameworks (CWE, CVE, MITRE ATT&CK, PCI DSS) | 6 | MEDIUM | incremental | application | 551 | |
| GitHub | GopherSecurity/gopher-mcp C++ SDK implementation of the Model Context Protocol (MCP) with enterprise security, observability, and connectivity features for building secure AI agent integrations | 6 | MEDIUM | incremental | component | 102 | |
| arXiv | Sim2Field: End-to-End Development of AI RANs for 6G End-to-end framework for developing and validating AI/ML models for cellular Radio Access Networks (RAN) with bridging of the simulation-to-field reality gap for 5G/6G deployment | 6 | HIGH | novel_combination | framework | 0 | |
| GitHub | stacklok/toolhive Enterprise platform for running, managing, and orchestrating Model Context Protocol (MCP) servers with deployment, monitoring, and integration capabilities | 6 | HIGH | novel_combination | framework | 1,703 | |
| GitHub | dream-num/univer Full-stack spreadsheet framework with AI-native capabilities, enabling collaborative spreadsheet creation/editing across web and server with natural language interfaces | 6 | HIGH | novel_combination | framework | 12,729 | |
| GitHub | janhq/jan Local-first, privacy-preserving LLM inference platform providing offline ChatGPT-like capabilities on consumer hardware | 6 | HIGH | reimplementation | application | 41,646 | |
| GitHub | CloudDetail/apo Comprehensive observability platform combining OpenTelemetry with eBPF for automated system monitoring, tracing, and LLM-powered troubleshooting | 6 | MEDIUM | novel_combination | framework | 377 | |
| GitHub | mavdol/capsule Secure sandbox runtime for executing untrusted AI agent code using WebAssembly with isolation guarantees | 6 | MEDIUM | novel_combination | component | 276 | |
| arXiv | MATTERIX: toward a digital twin for robotics-assisted chemistry laboratory automation GPU-accelerated multiscale robotic simulation framework for creating digital twins of chemistry laboratories to accelerate materials discovery workflows and reduce physical experimental iterations | 6 | HIGH | novel_combination | framework | 0 | |
| GitHub | MervinPraison/PraisonAI Low-code multi-agent AI orchestration framework for automating complex workflows with autonomous agents that integrate with messaging platforms (Telegram, Discord, WhatsApp) and support 100+ LLMs. | 6 | HIGH | incremental | framework | 6,813 | |
| GitHub | MemMachine/MemMachine Universal memory layer providing scalable, extensible storage and retrieval infrastructure for AI agent state management and reasoning | 6 | HIGH | novel_combination | framework | 5,360 | |
| GitHub | helicalAI/helical Framework for leveraging pre-trained foundation models on genomic and transcriptomic data, providing unified access to state-of-the-art models for biological sequence analysis and embedding generation. | 6 | MEDIUM | novel_combination | framework | 197 | |
| GitHub | ICube-Robotics/iiwa_ros2 ROS2 integration stack for KUKA iiwa collaborative robotic arms, providing hardware drivers, control interfaces, and simulation support | 6 | LOW | incremental | framework | 127 | |
| GitHub | langfuse/langfuse-js JavaScript/TypeScript SDK for LLM application instrumentation, tracing, and observability across any LLM provider or framework | 6 | MEDIUM | incremental | library | 128 | |
| arXiv | A Mixture of Experts Foundation Model for Scanning Electron Microscopy Image Analysis Foundation model for Scanning Electron Microscopy image analysis using Mixture of Experts architecture, enabling multi-task transfer learning across diverse SEM imaging conditions and instruments | 6 | MEDIUM | novel_combination | framework | 0 | |
| arXiv | RDT2: Exploring the Scaling Limit of UMI Data Towards Zero-Shot Cross-Embodiment Generalization Vision-Language-Action foundation model for zero-shot cross-embodiment robotic generalization trained on 10,000+ hours of diverse demonstrations | 6 | HIGH | novel_combination | framework | 0 | |
| GitHub | Peergos/Peergos Decentralized peer-to-peer file storage, synchronization, and social networking system with end-to-end encryption and application protocol layer | 6 | LOW | novel_combination | framework | 2,391 | |
| arXiv | Beyond Conditional Computation: Retrieval-Augmented Genomic Foundation Models with Gengram Retrieval-augmented genomic foundation model module that uses genomic-specific hashing to efficiently retrieve multi-base motifs, reducing computational overhead while improving accuracy on functional genomics tasks | 6 | MEDIUM | novel_combination | component | 0 | |
| GitHub | beelzebub-labs/beelzebub A high-interaction honeypot framework that uses LLMs and Lua scripting to simulate realistic system environments for threat intelligence and attacker analysis. | 6 | LOW | novel_combination | framework | 1,941 | |
| GitHub | mastra-ai/mastra TypeScript framework for building AI-powered applications and agents with orchestration, memory, tool integration, and LLM connectivity | 6 | HIGH | incremental | framework | 22,784 | |
| GitHub | ModelTC/LightLLM High-performance LLM inference and serving framework with lightweight, scalable architecture for deploying large language models | 6 | HIGH | incremental | framework | 3,996 | |
| GitHub | dreadnode/AIRTBench-Code Benchmark suite for evaluating autonomous AI red-teaming capabilities in language models, measuring safety vulnerabilities and adversarial robustness | 6 | MEDIUM | novel_combination | framework | 97 | |
| GitHub | Classiq/classiq-library Curated library and reference implementation collection for quantum algorithms and applications, designed for exploration and learning in quantum computing | 6 | MEDIUM | reimplementation | component | 1,986 | |
| GitHub | apple/pfl-research Simulation framework for accelerating research in Private Federated Learning (PFL), enabling researchers to prototype and benchmark federated learning algorithms with differential privacy constraints | 6 | MEDIUM | novel_combination | framework | 353 | |
| GitHub | relizaio/rearm Release-level software supply chain evidence platform for storing, versioning, and auditing SBOMs, xBOMs, and software artifacts with 10+ year retention and compliance-ready audit trails | 6 | MEDIUM | novel_combination | application | 111 | |
| GitHub | PaddlePaddle/FastDeploy High-performance inference and deployment toolkit optimized for large language models (LLMs) and vision-language models (VLMs), providing multi-backend support, quantization, and cross-platform deployment | 6 | HIGH | incremental | framework | 3,671 | |
| GitHub | SolaceLabs/solace-agent-mesh Event-driven framework for orchestrating multi-agent AI systems with real-world data integration and complex workflow execution | 6 | HIGH | novel_combination | framework | 2,980 | |
| GitHub | PECOS-packages/PECOS Software framework for designing, simulating, and evaluating quantum error-correction (QEC) protocols with support for multiple code families and noise models | 5 | MEDIUM | incremental | framework | 48 | |
| GitHub | bytecodealliance/wasmtime-rb Ruby bindings for Wasmtime WebAssembly runtime, enabling execution of WASM modules from Ruby applications | 5 | LOW | derivative | component | 137 |