alphabase.psm_reader.msfragger_reader¶
MSFragger reader.
Classes:
Translate MSFragger PSM.TSV modifications to alphabase format. |
|
|
Deprecated. |
|
Reader for MSFragger's pep.xml file. |
|
Reader for MSFragger's psm.tsv file. |
Functions:
Register MSFragger readers. |
- class alphabase.psm_reader.msfragger_reader.MSFraggerModificationTranslator(mass_mapped_mods: List[str], mod_mass_tol: float, rev_mod_mapping: Dict[str, str])[source][source]¶
Bases:
objectTranslate MSFragger PSM.TSV modifications to alphabase format.
Methods:
__init__(mass_mapped_mods, mod_mass_tol, ...)Initialize MSFragger modification translator.
translate(psm_df)Translate modifications from MSFragger assigned modifications.
- __init__(mass_mapped_mods: List[str], mod_mass_tol: float, rev_mod_mapping: Dict[str, str])[source][source]¶
Initialize MSFragger modification translator.
- Parameters:
mass_mapped_mods (List[str]) – List of modification names to match against (e.g., [‘Phospho@S’, ‘Oxidation@M’])
mod_mass_tol (float) – Mass tolerance for matching modifications in Daltons.
rev_mod_mapping (Dict[str, str]) – Reverse mapping from MSFragger format to alphabase format. Keys use MSFragger’s native format: ‘AA(mass)’ or ‘N-term(mass)’. Values use alphabase format: ‘Mod@AA’.
- translate(psm_df: DataFrame) DataFrame[source][source]¶
Translate modifications from MSFragger assigned modifications.
- Parameters:
psm_df (pd.DataFrame) – DataFrame with PsmDfCols.TMP_MODS column containing raw assigned modifications strings
- Returns:
The input DataFrame with ‘mods’ and ‘mod_sites’ columns added
- Return type:
pd.DataFrame
- class alphabase.psm_reader.msfragger_reader.MSFraggerPepXML(*args, **kwargs)[source][source]¶
Bases:
MSFraggerPepXMLReaderDeprecated.
Methods:
__init__(*args, **kwargs)Deprecated.
- class alphabase.psm_reader.msfragger_reader.MSFraggerPepXMLReader(*, column_mapping: dict | None = None, modification_mapping: dict | None = None, fdr: float = 0.001, keep_decoy: bool = False, rt_unit: str | None = None, keep_unknown_aa_mass_diffs: bool = False, **kwargs)[source][source]¶
Bases:
PSMReaderBaseReader for MSFragger’s pep.xml file.
Methods:
__init__(*[, column_mapping, ...])Initialize the MSFraggerreader.
- __init__(*, column_mapping: dict | None = None, modification_mapping: dict | None = None, fdr: float = 0.001, keep_decoy: bool = False, rt_unit: str | None = None, keep_unknown_aa_mass_diffs: bool = False, **kwargs)[source][source]¶
Initialize the MSFraggerreader.
See documentation of PSMReaderBase for more information.
MSFragger is not fully supported as we can only access the pepxml file.
- Parameters:
keep_unknown_aa_mass_diffs – whether to keep PSMs with unknown amino acid mass differences, default: False
See documentation of PSMReaderBase for the rest of parameters.
- class alphabase.psm_reader.msfragger_reader.MSFraggerPsmTsvReader(*, column_mapping: dict | None = None, modification_mapping: dict | None = None, fdr: float = 0.01, keep_decoy: bool = False, rt_unit: str | None = None, **kwargs)[source][source]¶
Bases:
PSMReaderBaseReader for MSFragger’s psm.tsv file.
Methods:
__init__(*[, column_mapping, ...])Initialize MSFragger PSM TSV reader.
- __init__(*, column_mapping: dict | None = None, modification_mapping: dict | None = None, fdr: float = 0.01, keep_decoy: bool = False, rt_unit: str | None = None, **kwargs)[source][source]¶
Initialize MSFragger PSM TSV reader.
- Parameters:
column_mapping (Optional[dict]) – Custom column name mapping.
modification_mapping (Optional[dict]) – Custom modification mapping from alphabase format to MSFragger format. Keys use alphabase format: ‘Mod@AA’. Values use MSFragger’s native format: ‘AA(mass)’ or ‘N-term(mass)’ or ‘C-term(mass)’. Example: {‘Phospho@S’: ‘S(79.9663)’, ‘TMTpro@Any_N-term’: ‘N-term(304.2071)’}
fdr (float) – False discovery rate threshold. Default: 0.01
keep_decoy (bool) – Whether to keep decoy hits. Default: False
rt_unit (Optional[str]) – Retention time unit.
**kwargs – Additional arguments passed to PSMReaderBase.