TLimmuno2 predicts whether a peptide presented by an MHC class II molecule will trigger a CD4+ T cell immune response. Where most immunogenicity tools focus on MHC class I (CD8+ T cells), TLimmuno2 addresses the less-covered class II pathway, which is central to helper T cell activation, vaccine design, and cancer neoantigen identification.
The model uses transfer learning: an LSTM network is first trained on over 100,000 peptide-MHC binding affinity measurements, then fine-tuned on immunogenicity data. This two-stage approach compensates for the limited amount of experimentally validated immunogenicity data available for MHC-II epitopes.
TLimmuno2 runs two neural networks in sequence:
Binding affinity model (BAmodel): An LSTM trained on 107,008 binding measurements from NetMHCIIpan across 71 MHC-II molecules. Rather than using the final binding prediction, TLimmuno2 extracts intermediate features from this model's penultimate layer, capturing learned representations of peptide-MHC interaction patterns.
Immunogenicity model: A second LSTM that takes three inputs — the BLOSUM62-encoded peptide, the encoded MHC pseudosequence, and the binding affinity features from stage one — and predicts the probability that the complex will elicit a T cell response.
Both peptides and MHC pseudosequences are encoded using the BLOSUM62 substitution matrix, which captures biochemical similarity between amino acids. Peptides are padded to 21 residues and MHC pseudosequences to 34 residues, producing fixed-size matrices that the LSTM layers can process.
Raw immunogenicity scores lack context without a reference distribution. When percentile ranking is enabled, TLimmuno2 scores approximately 90,000 random human peptides (sampled across lengths 13–21) against the same HLA allele and reports where the query peptide falls in that distribution. A percentile rank of 0.95 means the peptide scores higher than 95% of background peptides for that allele.
ProteinIQ provides cloud-hosted access to TLimmuno2 with support for over 2,000 HLA class II alleles, covering the DRB1, DRB3, DRB4, DRB5, and DPA/DPB loci. No installation or Python environment required.
| Input | Description |
|---|---|
Peptide Sequences | One or more peptide sequences in FASTA format or one per line. MHC-II epitopes are typically 13–25 amino acids, though sequences from 9–50 residues are accepted. |
| Setting | Description |
|---|---|
HLA assignment mode | Single HLA for all peptides applies one allele to every input sequence. One HLA per peptide allows specifying a different allele for each sequence. |
HLA allele | The MHC-II allele in underscore format (e.g., DRB1_0101, DRB1_0401, DRB5_0101). Used when assignment mode is set to single. |
Per-sequence HLA alleles | One allele per line, matching the order of input peptides. Required when using per-sequence mode. |
Immunogenicity threshold | Score cutoff for the immunogenic/non-immunogenic label (0–1, default 0.5). Peptides scoring at or above the threshold are labeled immunogenic. |
Include percentile ranking |
| Column | Description |
|---|---|
Immunogenicity Score | Predicted probability of triggering a CD4+ T cell response (0–1). Higher scores indicate greater immunogenic potential. |
Percentile Rank | Position relative to background peptides for the same HLA allele (0–1). Only populated when percentile ranking is enabled. |
Label | Binary classification based on the chosen threshold: immunogenic or non-immunogenic. |
The immunogenicity score is a continuous probability. A peptide scoring 0.8 is not necessarily four times as immunogenic as one scoring 0.2 — the score reflects model confidence, not magnitude of immune response.
| Score range | Interpretation |
|---|---|
| > 0.7 | Strong predicted immunogenicity |
| 0.4–0.7 | Moderate — experimental validation recommended |
| < 0.4 | Low predicted immunogenicity |
Percentile rank provides additional context. A peptide with a moderate score of 0.5 but a high percentile rank (> 0.9) is still notable — it scores above 90% of random peptides for that allele, suggesting genuine immunogenic potential even if the absolute score appears modest.
MHC class I molecules present intracellular peptides (typically 8–11 residues) to CD8+ cytotoxic T cells. MHC class II molecules present extracellular peptides (typically 13–25 residues) to CD4+ helper T cells. The two pathways involve different antigen processing machinery, binding groove geometry, and downstream immune effects.
TLimmuno2 specifically models the class II pathway. For MHC class I immunogenicity prediction, DeepImmuno covers 20 HLA-A/B/C alleles with a CNN-based approach.
| Ranks each peptide against ~90,000 background peptides per HLA allele. Off by default since it adds roughly 3 minutes per unique HLA allele. |