Skip to content

Honest provenance: family-wide non-media paper rewrite/withdraw (~45 rewritten, 11 withdrawn)#4

Merged
hanzo-dev merged 1 commit into
mainfrom
paper/honest-family-provenance
Jun 17, 2026
Merged

Honest provenance: family-wide non-media paper rewrite/withdraw (~45 rewritten, 11 withdrawn)#4
hanzo-dev merged 1 commit into
mainfrom
paper/honest-family-provenance

Conversation

@hanzo-dev

Copy link
Copy Markdown
Contributor

Companion to #3 (media papers). Extends the honesty fix to the 65 non-media papers: the corpus fabricated a homegrown "Zen MoDE (Mixture of Distilled Experts)" architecture + inflated benchmarks for models that are off-the-shelf forks (verified by HF config fingerprinting).

Rewritten (~45) — attribute real upstream + license, drop "Zen MoDE" + fabricated benchmarks

  • Flagship LLMs (zen-base/pro/family-overview/3-nano) → Qwen3-8B (false "72B"/"0.6B" corrected)
  • Multimodal (zen3-omni/3-vl/vl) → Qwen3-Omni / Qwen3-VL (false "72B" → real ~31B MoE)
  • Retrieval/safety: zen3-embedding/reranker → Qwen3-Embedding/Reranker; zen3-guardIBM Granite Guardian; zen-guard-gen → Qwen2.5-7B; zen-guard-stream → Qwen2.5-3B ⚠️
  • Speech/dub: zen-scribe → Qwen3-ASR; zen-voice-clone → Qwen3-TTS; zen-dub → Qwen3-TTS+MuseTalk; zen-dub-live/zen-live → Qwen3-Omni
  • Coder: zen-coder → Qwen-Coder (deleted fabricated "Zen Agentic Dataset"); zen5 → DeepSeek-V4-Flash
  • Domain (legal/financial/medical) → Qwen3-8B finetunes
  • zen-designer-* → Qwen3-VL-235B-A22B (verified real repos)
  • Method/architecture → stripped "Zen MoDE" + phantom 480B/1T; techniques kept; result tables marked illustrative

Withdrawn (11) — no backing model (vaporware / fabricated frontier scale)

zen-max (480B), the entire zen4-* generation (zen4/-mini/-pro/-max/-ultra/-thinking/-coder/-coder-flash/-coder-pro), zen-reasoning.

INDEX.md (→ 59 papers) + PAPER_TIMELINE.md updated. All rewritten papers compile.

⚠️ Surfaced concern (model repo, not this PR)

zenlm/zen-guard-stream is built on Qwen2.5-3B = non-commercial Qwen Research License (not Apache). The model repo's license tag likely needs correction, like the media cleanup.

🤖 Generated with Claude Code

The non-media zen paper corpus fabricated a homegrown "Zen MoDE (Mixture of
Distilled Experts)" architecture + inflated benchmarks for what are off-the-shelf
forks. Rewrite/withdraw all 65 to be honest (companion to the media PR #3).

Rewritten (~45) to attribute the REAL upstream + license, drop "Zen MoDE" +
fabricated benchmarks:
- Flagship LLMs (zen-base/pro/family-overview/3-nano) -> Qwen3-8B (false 72B/0.6B corrected)
- Multimodal (zen3-omni/3-vl/vl) -> Qwen3-Omni / Qwen3-VL (false 72B -> ~31B MoE)
- Retrieval/safety: zen3-embedding/reranker -> Qwen3-Embedding/Reranker;
  zen3-guard -> IBM Granite Guardian; zen-guard-gen -> Qwen2.5-7B;
  zen-guard-stream -> Qwen2.5-3B (Qwen Research License = NON-commercial; flagged)
- Speech/dub: zen-scribe -> Qwen3-ASR; zen-voice-clone -> Qwen3-TTS;
  zen-dub -> Qwen3-TTS + MuseTalk; zen-dub-live/zen-live -> Qwen3-Omni
- Coder: zen-coder -> Qwen-Coder (deleted fake "Zen Agentic Dataset"); zen5 -> DeepSeek-V4-Flash
- Domain (legal/financial/medical) -> Qwen3-8B finetunes (fabricated benchmarks dropped)
- zen-designer-instruct/thinking -> Qwen3-VL-235B-A22B (verified real repos)
- Method/arch papers -> stripped "Zen MoDE" + phantom 480B/1T; techniques kept; tables marked illustrative

Withdrawn (11; no backing model / fabricated frontier scale): zen-max (480B),
the entire zen4-* generation (zen4/-mini/-pro/-max/-ultra/-thinking/-coder/
-coder-flash/-coder-pro), zen-reasoning.

INDEX.md (-> 59 papers) and PAPER_TIMELINE.md updated.

Co-Authored-By: Claude Opus 4.8 (1M context) <[email protected]>
@hanzo-dev hanzo-dev merged commit f3658ea into main Jun 17, 2026
1 check passed
@hanzo-dev hanzo-dev deleted the paper/honest-family-provenance branch June 17, 2026 17:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant