Honest provenance: family-wide non-media paper rewrite/withdraw (~45 rewritten, 11 withdrawn)#4
Merged
Merged
Conversation
The non-media zen paper corpus fabricated a homegrown "Zen MoDE (Mixture of Distilled Experts)" architecture + inflated benchmarks for what are off-the-shelf forks. Rewrite/withdraw all 65 to be honest (companion to the media PR #3). Rewritten (~45) to attribute the REAL upstream + license, drop "Zen MoDE" + fabricated benchmarks: - Flagship LLMs (zen-base/pro/family-overview/3-nano) -> Qwen3-8B (false 72B/0.6B corrected) - Multimodal (zen3-omni/3-vl/vl) -> Qwen3-Omni / Qwen3-VL (false 72B -> ~31B MoE) - Retrieval/safety: zen3-embedding/reranker -> Qwen3-Embedding/Reranker; zen3-guard -> IBM Granite Guardian; zen-guard-gen -> Qwen2.5-7B; zen-guard-stream -> Qwen2.5-3B (Qwen Research License = NON-commercial; flagged) - Speech/dub: zen-scribe -> Qwen3-ASR; zen-voice-clone -> Qwen3-TTS; zen-dub -> Qwen3-TTS + MuseTalk; zen-dub-live/zen-live -> Qwen3-Omni - Coder: zen-coder -> Qwen-Coder (deleted fake "Zen Agentic Dataset"); zen5 -> DeepSeek-V4-Flash - Domain (legal/financial/medical) -> Qwen3-8B finetunes (fabricated benchmarks dropped) - zen-designer-instruct/thinking -> Qwen3-VL-235B-A22B (verified real repos) - Method/arch papers -> stripped "Zen MoDE" + phantom 480B/1T; techniques kept; tables marked illustrative Withdrawn (11; no backing model / fabricated frontier scale): zen-max (480B), the entire zen4-* generation (zen4/-mini/-pro/-max/-ultra/-thinking/-coder/ -coder-flash/-coder-pro), zen-reasoning. INDEX.md (-> 59 papers) and PAPER_TIMELINE.md updated. Co-Authored-By: Claude Opus 4.8 (1M context) <[email protected]>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Companion to #3 (media papers). Extends the honesty fix to the 65 non-media papers: the corpus fabricated a homegrown "Zen MoDE (Mixture of Distilled Experts)" architecture + inflated benchmarks for models that are off-the-shelf forks (verified by HF config fingerprinting).
Rewritten (~45) — attribute real upstream + license, drop "Zen MoDE" + fabricated benchmarks
zen-base/pro/family-overview/3-nano) → Qwen3-8B (false "72B"/"0.6B" corrected)zen3-omni/3-vl/vl) → Qwen3-Omni / Qwen3-VL (false "72B" → real ~31B MoE)zen3-embedding/reranker→ Qwen3-Embedding/Reranker;zen3-guard→ IBM Granite Guardian;zen-guard-gen→ Qwen2.5-7B;zen-guard-stream→ Qwen2.5-3Bzen-scribe→ Qwen3-ASR;zen-voice-clone→ Qwen3-TTS;zen-dub→ Qwen3-TTS+MuseTalk;zen-dub-live/zen-live→ Qwen3-Omnizen-coder→ Qwen-Coder (deleted fabricated "Zen Agentic Dataset");zen5→ DeepSeek-V4-Flashzen-designer-*→ Qwen3-VL-235B-A22B (verified real repos)Withdrawn (11) — no backing model (vaporware / fabricated frontier scale)
zen-max(480B), the entirezen4-*generation (zen4/-mini/-pro/-max/-ultra/-thinking/-coder/-coder-flash/-coder-pro),zen-reasoning.INDEX.md(→ 59 papers) +PAPER_TIMELINE.mdupdated. All rewritten papers compile.zenlm/zen-guard-streamis built on Qwen2.5-3B = non-commercial Qwen Research License (not Apache). The model repo's license tag likely needs correction, like the media cleanup.🤖 Generated with Claude Code