Popular repositories Loading
-
scribegoat2
scribegoat2 PublicOpen-source medical LLM safety evaluation pipeline with reproducible benchmarks and high-risk clinical failure analysis.
-
lostbench
lostbench PublicStandalone benchmark for multi-turn safety persistence in medical LLM conversations. Measures recommendation monotonicity under sustained patient pressure.
Python
-
openem-corpus
openem-corpus PublicThe AI-native emergency medicine knowledge base. Agent-compiled, physician-verified, grep-friendly.
Python
-
safeshift
safeshift PublicDoes making the model faster make it less safe? Safety degradation benchmarking under inference optimization.
Python
-
radslice
radslice PublicMultimodal radiology LLM benchmark across CT, MRI, X-ray, and Ultrasound
Python
Repositories
- medomni Public
Sovereign nurse-first medical-LLM stack on NVIDIA's open-component stack — held-out 0.369 mean (N=30), manifest-locked reproducibility
GOATnote-Inc/medomni’s past year of commit activity - healthcraft Public
HEALTHCRAFT RL Training Environment: adapts the CORECRAFT architecture to emergency medicine
GOATnote-Inc/healthcraft’s past year of commit activity - receipts Public
Append-only intent-vs-execution attestation ledger — Engineering Receipts + Clinical Audit Ledger. Audit-grade lineage with Merkle hash chain, kappa-graded dual-judge, FHIR R4 attestation extensions.
GOATnote-Inc/receipts’s past year of commit activity - medimage-corpus Public
Registry of 134 open-source medical imaging datasets (CT, X-ray, MRI, ultrasound, VLM-paired) with manifests, download dispatchers, and format converters for vision/VLM training.
GOATnote-Inc/medimage-corpus’s past year of commit activity - prism42 Public
Managed Agents harness on Claude Opus 4.7 for kernel correctness research and clinical reasoning auditing
GOATnote-Inc/prism42’s past year of commit activity - lostbench Public
Standalone benchmark for multi-turn safety persistence in medical LLM conversations. Measures recommendation monotonicity under sustained patient pressure.
GOATnote-Inc/lostbench’s past year of commit activity - MedAgentBench Public Forked from stanfordmlgroup/MedAgentBench
MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents
GOATnote-Inc/MedAgentBench’s past year of commit activity - scribegoat2 Public
Open-source medical LLM safety evaluation pipeline with reproducible benchmarks and high-risk clinical failure analysis.
GOATnote-Inc/scribegoat2’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…