Overhaul model system with multi-stage training, variants, benchmarks, and eval
CI / build-and-push (push) Successful in 32s

Replace the single-stage training + flat capability score with a realistic AI
development pipeline: pre-training with Chinchilla scaling laws, SFT with
specializations, alignment with safety/capability tradeoffs (RLHF/DPO/Constitutional),
model families with distillation/fine-tuning/quantization variants, named benchmark
suite with compute-costing eval jobs, and segment-specific market quality.

Phases 1-6 of the model rework plan: new types, engine rewrite, save migration,
training events/risk system, concurrent training, variant creation, benchmark
evaluation with leaderboard, and market integration.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-04-25 07:36:34 -04:00
parent fc1f371c8c
commit 4c1c0e9ff2
24 changed files with 2157 additions and 357 deletions
@@ -107,9 +107,9 @@ export function StateInspectionTab() {
<Stat label="Completed" value={research.completedResearch.length} />
<Stat label="Points" value={research.researchPoints.toFixed(1)} />
<Stat label="Active" value={research.activeResearch?.researchId ?? 'None'} />
<Stat label="Models" value={models.trainedModels.length} />
<Stat label="Training" value={models.activeTraining?.modelName ?? 'None'} />
<Stat label="Deployed" value={models.trainedModels.filter(m => m.isDeployed).length} />
<Stat label="Models" value={models.baseModels.length} />
<Stat label="Training" value={models.activeTrainingPipelines.filter(p => p.status === 'active').length} />
<Stat label="Deployed" value={models.baseModels.filter(m => m.isDeployed).length} />
</Section>
</div>
);