Skip to content

Honest media-paper provenance: attribute real upstreams, withdraw fabricated#3

Merged
hanzo-dev merged 1 commit into
mainfrom
paper/honest-media-provenance
Jun 17, 2026
Merged

Honest media-paper provenance: attribute real upstreams, withdraw fabricated#3
hanzo-dev merged 1 commit into
mainfrom
paper/honest-media-provenance

Conversation

@hanzo-dev

Copy link
Copy Markdown
Contributor

Aligns the media whitepapers with the cleaned-up, permissively-licensed model repos — the same honesty fix we applied to the HF model repos: truthful upstream attribution + real licenses, and removal of the fabricated "Zen MoDE" architecture and invented benchmarks.

Rewritten (5) — now honestly attribute their real upstream + license

  • zen-3d → Microsoft TRELLIS (MIT) — image-to-3D via Structured LATents
  • zen-video-i2v → Alibaba Wan2.2-I2V-A14B (Apache-2.0)
  • zen-world → Alibaba Wan2.1-T2V-14B (Apache-2.0)
  • zen-director → Alibaba Wan2.2-TI2V-5B (Apache-2.0)
  • zen-musician → M-A-P YuE (Apache-2.0)

Each drops the fabricated "Zen MoDE (Mixture of Distilled Experts)" backbone and invented benchmarks; all quantitative claims are now cited to the upstream authors. All five compile.

Withdrawn (4) — fabricated or discontinued

  • zen-mixture-of-experts — the fabricated "Zen MoDE" architecture the family falsely cited
  • zen-voyager — was a mislabeled Qwen3-32B LLM; repo deleted
  • zen-foley — no permissively-licensed video→audio foley model exists (honest gap)
  • zen-video — deleted Tencent-mix repo (superseded by zen-video-i2v / zen-world)

INDEX.md and PAPER_TIMELINE.md updated to drop the withdrawn papers.

Out of scope (flagged)

The broader "Zen MoDE" narrative + fictional dates persist across the ~85 non-media papers (LLM/coder/etc.) and PAPER_TIMELINE.md's general framing. A family-wide honesty pass is a separate, larger effort (the "audit all ~90 papers" option), not attempted here.

🤖 Generated with Claude Code

…ricated

Rewrite 5 media whitepapers to truthfully attribute their permissively-licensed
upstreams (dropping the fabricated "Zen MoDE" architecture + invented benchmarks):
- zen-3d         -> Microsoft TRELLIS (MIT)
- zen-video-i2v  -> Alibaba Wan2.2-I2V-A14B (Apache-2.0)
- zen-world      -> Alibaba Wan2.1-T2V-14B (Apache-2.0)
- zen-director   -> Alibaba Wan2.2-TI2V-5B (Apache-2.0)
- zen-musician   -> M-A-P YuE (Apache-2.0)

Withdraw 4 fabricated/discontinued papers:
- zen-mixture-of-experts (the fabricated "Zen MoDE" architecture)
- zen-voyager (mislabeled Qwen3-32B LLM; repo deleted)
- zen-foley (no permissively-licensed foley model exists)
- zen-video (deleted Tencent-mix repo)

Drop the withdrawn papers from INDEX.md and PAPER_TIMELINE.md.

Co-Authored-By: Claude Opus 4.8 (1M context) <[email protected]>
@hanzo-dev hanzo-dev merged commit ffc4006 into main Jun 17, 2026
1 check passed
@hanzo-dev hanzo-dev deleted the paper/honest-media-provenance branch June 17, 2026 05:24
hanzo-dev added a commit that referenced this pull request Jun 17, 2026
The non-media zen paper corpus fabricated a homegrown "Zen MoDE (Mixture of
Distilled Experts)" architecture + inflated benchmarks for what are off-the-shelf
forks. Rewrite/withdraw all 65 to be honest (companion to the media PR #3).

Rewritten (~45) to attribute the REAL upstream + license, drop "Zen MoDE" +
fabricated benchmarks:
- Flagship LLMs (zen-base/pro/family-overview/3-nano) -> Qwen3-8B (false 72B/0.6B corrected)
- Multimodal (zen3-omni/3-vl/vl) -> Qwen3-Omni / Qwen3-VL (false 72B -> ~31B MoE)
- Retrieval/safety: zen3-embedding/reranker -> Qwen3-Embedding/Reranker;
  zen3-guard -> IBM Granite Guardian; zen-guard-gen -> Qwen2.5-7B;
  zen-guard-stream -> Qwen2.5-3B (Qwen Research License = NON-commercial; flagged)
- Speech/dub: zen-scribe -> Qwen3-ASR; zen-voice-clone -> Qwen3-TTS;
  zen-dub -> Qwen3-TTS + MuseTalk; zen-dub-live/zen-live -> Qwen3-Omni
- Coder: zen-coder -> Qwen-Coder (deleted fake "Zen Agentic Dataset"); zen5 -> DeepSeek-V4-Flash
- Domain (legal/financial/medical) -> Qwen3-8B finetunes (fabricated benchmarks dropped)
- zen-designer-instruct/thinking -> Qwen3-VL-235B-A22B (verified real repos)
- Method/arch papers -> stripped "Zen MoDE" + phantom 480B/1T; techniques kept; tables marked illustrative

Withdrawn (11; no backing model / fabricated frontier scale): zen-max (480B),
the entire zen4-* generation (zen4/-mini/-pro/-max/-ultra/-thinking/-coder/
-coder-flash/-coder-pro), zen-reasoning.

INDEX.md (-> 59 papers) and PAPER_TIMELINE.md updated.

Co-authored-by: Claude Opus 4.8 (1M context) <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant