How to use this notebook
Explain why unbiased metagenomics is useful in a difficult CNS infection case.
In this case, that objective expands into the wider concept of clinical metagenomics:
- why unbiased sequencing can help when routine tests are unrevealing,
- how metagenomics supports clinical reasoning rather than replacing it,
- why sample type, host background, and classification uncertainty matter,
- and how to interpret both informative and non-informative results in a hospital setting.
- Start with the Diagnostic simulation activity below.
- Use the admission record and manual like a real hospital case discussion.
- Ask learners to commit to an early differential diagnosis before they see the metagenomic result.
- Then move into the direct metagenomic output section and the deeper analytical interpretation.
This notebook is designed to feel like a clinical reasoning exercise first and a bioinformatics notebook second. The goal is to let learners move from:
- syndrome recognition,
- to staged diagnostic refinement,
- to specimen-level metagenomic interpretation,
- to a final clinically defensible diagnosis.
Aim of this notebook
This is not a full replication study. Instead, it is a guided self-study notebook for healthcare workers learning how clinical metagenomics can support difficult diagnostic decisions. The notebook:
- reconstructs the diagnostic teaching context,
- inspects the locally available re-analysis outputs,
- shows which samples are informative and which are not,
- makes the role of unclassified signal explicit,
- compares the recreated results against the original paper,
- and connects the sequencing outputs back to clinical reasoning in a realistic hospital workflow.
How this notebook fits the teaching package
This notebook is designed as a single self-guided teaching resource. It does not separate student and instructor versions. Instead, it lets learners:
- read the clinical scenario,
- decide what information they need next,
- answer short questions before opening hints,
- reveal worked solutions only when needed,
- run or inspect metagenomic workflow outputs,
- and progressively connect sequencing evidence back to clinical reasoning.
The final diagnosis is still delayed until after learners have worked through the clinical and metagenomic clues.
Interpretive guardrails used throughout
- This notebook uses the local files available in the workshop directory; it does not reconstruct the full original UCSF SURPI workflow.
- The supplied workshop export files
nejm_with_unclassified.csv, nejm_without_unclassified.csv, and nejm.phyloseq/ retain only one sample after strict post-filtering.
- The broader four-sample diagnostic set is therefore reconstructed here from the underlying per-sample taxonomic summaries in
../materials/clinical-reasoning/Readbased_Analysis/.
- For abundance-based summaries, the main metric is
f_weighted_at_rank.
- The value called
unweighted_fraction is still useful, but it should not be treated as if it must behave like a conventional 100% compositional percentage after every filtering step.
- Diversity and ordination plots are shown only as descriptive teaching visuals. With four runs, they are not suitable for formal inference.
- AMR outputs are inspected because they are present, but they are secondary to the diagnostic question in this case.
- The final in-depth analytical material is retained near the end because it is most valuable for learners who want a research-style debrief after the clinical exercise.
Diagnostic simulation activity
This front section functions as a realistic diagnostic simulation for clinicians. Work through it sequentially before opening the later metagenomic interpretation.
Admission record
Hospital: Tertiary referral hospital, Ho Chi Minh City
Department: Emergency Department → Intensive Care Unit
Patient: 14-year-old male
Reason for admission: Fever, severe headache, altered mental status, and one generalized seizure
Background
The patient has a known history of X-linked agammaglobulinemia (XLA) and receives regular intravenous immunoglobulin replacement. He has no previous history of central nervous system infection.
History of present illness
He had been well until 6 days before admission, when he developed persistent fever, diffuse myalgia, and progressive frontal headache. He was initially treated at a district-level facility with supportive care and oral amoxicillin-clavulanate, without clear improvement.
Over the 48 hours before transfer, the headache became severe and was associated with photophobia, lethargy, and reduced interaction with family. On the day of admission, he had a generalized tonic-clonic seizure lasting approximately 4 minutes, prompting urgent transfer.
Examination on arrival
- Temperature: 39.5°C
- Heart rate: 128/min
- Blood pressure: 105/65 mmHg
- Respiratory rate: 22/min
- SpO2: 97% on room air
The patient is stuporous but withdraws to pain. Marked neck stiffness is present. Kernig and Brudzinski signs are positive. No petechiae, purpura, eschar, or visible jaundice are seen. Cardiorespiratory examination is otherwise unremarkable.
Initial laboratory results
- WBC: 16,500/µL, neutrophil predominant
- Hemoglobin: 13.2 g/dL
- Platelets: 115,000/µL
- CRP: elevated
- Creatinine: normal
- AST: mildly elevated
- ALT: mildly elevated
- Total bilirubin: normal
First discussion task
Discuss with your group:
- What syndromes are present at admission?
- What are the most likely diagnostic categories at this stage?
- What information would you request next from the manual?
Before the pipeline result
Write down:
- your top three diagnostic hypotheses,
- one diagnosis that has become less likely,
- and the single most important unanswered question in the case.
At this point, strong answers usually stay at the level of diagnostic categories or syndromic possibilities rather than jumping too early to one exact organism.
Back to top