CORE FUNCTION

A 1B-parameter diffusion-based small language model trained on FineWeb, utilizing the LLaDA framework for bidirectional context and denoising-based text generation.

TRACTION

stars

0.0 velocity

forks

0.0 velocity

REASONING

This is a very early-stage project (7 days old, 0 stars) attempting to replicate or apply the LLaDA (Large Language Diffusion with Autoregression) architecture. While diffusion for text is an interesting niche compared to standard autoregressive models, this specific project lacks the community traction, unique dataset, or architectural breakthrough required for a higher defensibility score. Frontier labs are actively researching discrete diffusion for LLMs, posing a high risk of obsolescence.

COMPOSABILITY

TECH STACK

PythonPyTorchLLaDAFineWeb-10BTTransformers

INTEGRATION

library_import

text_diffusionbidirectional_modelingslm_training

READINESS

Composabilityalgorithm

Depthprototype

Noveltyreimplementation