Honest media-paper provenance: attribute real upstreams, withdraw fabricated#3
Merged
Merged
Conversation
…ricated Rewrite 5 media whitepapers to truthfully attribute their permissively-licensed upstreams (dropping the fabricated "Zen MoDE" architecture + invented benchmarks): - zen-3d -> Microsoft TRELLIS (MIT) - zen-video-i2v -> Alibaba Wan2.2-I2V-A14B (Apache-2.0) - zen-world -> Alibaba Wan2.1-T2V-14B (Apache-2.0) - zen-director -> Alibaba Wan2.2-TI2V-5B (Apache-2.0) - zen-musician -> M-A-P YuE (Apache-2.0) Withdraw 4 fabricated/discontinued papers: - zen-mixture-of-experts (the fabricated "Zen MoDE" architecture) - zen-voyager (mislabeled Qwen3-32B LLM; repo deleted) - zen-foley (no permissively-licensed foley model exists) - zen-video (deleted Tencent-mix repo) Drop the withdrawn papers from INDEX.md and PAPER_TIMELINE.md. Co-Authored-By: Claude Opus 4.8 (1M context) <[email protected]>
hanzo-dev
added a commit
that referenced
this pull request
Jun 17, 2026
The non-media zen paper corpus fabricated a homegrown "Zen MoDE (Mixture of Distilled Experts)" architecture + inflated benchmarks for what are off-the-shelf forks. Rewrite/withdraw all 65 to be honest (companion to the media PR #3). Rewritten (~45) to attribute the REAL upstream + license, drop "Zen MoDE" + fabricated benchmarks: - Flagship LLMs (zen-base/pro/family-overview/3-nano) -> Qwen3-8B (false 72B/0.6B corrected) - Multimodal (zen3-omni/3-vl/vl) -> Qwen3-Omni / Qwen3-VL (false 72B -> ~31B MoE) - Retrieval/safety: zen3-embedding/reranker -> Qwen3-Embedding/Reranker; zen3-guard -> IBM Granite Guardian; zen-guard-gen -> Qwen2.5-7B; zen-guard-stream -> Qwen2.5-3B (Qwen Research License = NON-commercial; flagged) - Speech/dub: zen-scribe -> Qwen3-ASR; zen-voice-clone -> Qwen3-TTS; zen-dub -> Qwen3-TTS + MuseTalk; zen-dub-live/zen-live -> Qwen3-Omni - Coder: zen-coder -> Qwen-Coder (deleted fake "Zen Agentic Dataset"); zen5 -> DeepSeek-V4-Flash - Domain (legal/financial/medical) -> Qwen3-8B finetunes (fabricated benchmarks dropped) - zen-designer-instruct/thinking -> Qwen3-VL-235B-A22B (verified real repos) - Method/arch papers -> stripped "Zen MoDE" + phantom 480B/1T; techniques kept; tables marked illustrative Withdrawn (11; no backing model / fabricated frontier scale): zen-max (480B), the entire zen4-* generation (zen4/-mini/-pro/-max/-ultra/-thinking/-coder/ -coder-flash/-coder-pro), zen-reasoning. INDEX.md (-> 59 papers) and PAPER_TIMELINE.md updated. Co-authored-by: Claude Opus 4.8 (1M context) <[email protected]>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Aligns the media whitepapers with the cleaned-up, permissively-licensed model repos — the same honesty fix we applied to the HF model repos: truthful upstream attribution + real licenses, and removal of the fabricated "Zen MoDE" architecture and invented benchmarks.
Rewritten (5) — now honestly attribute their real upstream + license
Each drops the fabricated "Zen MoDE (Mixture of Distilled Experts)" backbone and invented benchmarks; all quantitative claims are now cited to the upstream authors. All five compile.
Withdrawn (4) — fabricated or discontinued
INDEX.mdandPAPER_TIMELINE.mdupdated to drop the withdrawn papers.Out of scope (flagged)
The broader "Zen MoDE" narrative + fictional dates persist across the ~85 non-media papers (LLM/coder/etc.) and PAPER_TIMELINE.md's general framing. A family-wide honesty pass is a separate, larger effort (the "audit all ~90 papers" option), not attempted here.
🤖 Generated with Claude Code