Instructor Notes
General Setup
Before the workshop:
- Verify all training accounts (
trainXX) can access both Negishi and Gilbreth clusters - Confirm the shared data is available at
/depot/workshop/data/annotation_workshop/ - Ensure
biocontainersmodule loads correctly and all tool modules are accessible - Pre-stage the GeneMark license key for learners or provide download instructions
- Test that GPU allocation works on Gilbreth for the Helixer episode
Timing Guide
| Episode | Teaching | Exercises | Notes |
|---|---|---|---|
| Introduction | 20 min | 0 min | Conceptual, no hands-on |
| Annotation Strategies | 20 min | 5 min | Conceptual with exercises |
| Annotation Setup | 15 min | 5 min | Walk through only; steps are pre-computed |
| Gene Annotation with BRAKER | 40 min | 10 min | Submit jobs early; review results while waiting |
| Gene Annotation with Helixer | 20 min | 5 min | Requires Gilbreth cluster; switch clusters during break |
| Gene Annotation with EASEL | 25 min | 5 min | Optional/self-guided if time is short |
| Functional Annotation with EnTAP | 20 min | 5 min | Databases are pre-staged |
| Annotation Assessment | 30 min | 10 min | Compare all methods together |
Common Learner Issues
Cluster Access
- Learners may forget to switch to Gilbreth for the Helixer episode. Remind them during the break before that session.
- SSH key setup can be tricky on Windows. Have learners test access during the Arrival & Setup period.
Module Loading
- Always use
ml --force purgebefore loading new modules. Without--force, sticky modules may cause conflicts. - If
biocontainersfails to load, check that the user’s.bashrcdoes not have conflicting module commands.
BRAKER3
- The GeneMark license key must be in
~/.gm_key. If learners get a “license key not found” error, have them re-copy the key. - AUGUSTUS config must be writable. Ensure the
copy_augustus_configstep completed successfully. - Jobs take 7-170 minutes depending on the case. Submit all 5 cases at the start and check results later.
Helixer
- GPU availability on Gilbreth may be limited. Use
--account=standbyfor opportunistic scheduling. - Model download to
~/.local/share/Helixercan be slow. Consider pre-staging models.
EASEL
- Nextflow version matters. The custom installation step avoids conflicts with the system module.
- The
rcac.configmust be correctly placed and referenced innextflow.config. - If the pipeline fails, check
.nextflow.logand thework/directory for process-level logs.
Challenge Exercise Answers
Answers to all challenge exercises are included in the
:::::: solution blocks within each episode. Review these
before the workshop to anticipate follow-up questions.