alphabase.psm_reader.pfind_reader#

Functions:

`convert_one_pFind_mod`(mod)
`get_pFind_mods`(pfind_mod_str)
`parse_pfind_protein`(protein[, keep_reverse])
`translate_pFind_mod`(mod_str)

Classes:

pFindReader(*[, column_mapping, ...])

alphabase.psm_reader.pfind_reader.convert_one_pFind_mod(mod)[source][source]#

alphabase.psm_reader.pfind_reader.get_pFind_mods(pfind_mod_str)[source][source]#

class alphabase.psm_reader.pfind_reader.pFindReader(*, column_mapping: dict = None, modification_mapping: dict = None, fdr=0.01, keep_decoy=False, **kwargs)[source][source]#

Bases: PSMReaderBase

Methods:

__init__(*[, column_mapping, ...])

The Base class for all PSMReaders.

__init__(*, column_mapping: dict = None, modification_mapping: dict = None, fdr=0.01, keep_decoy=False, **kwargs)[source][source]#

The Base class for all PSMReaders. The key of the sub-classes for different search engine format is to re-define column_mapping and modification_mapping.

Parameters:

column_mapping (dict, optional) – A dict that maps alphabase’s columns to other search engine’s. The key of the column_mapping is alphabase’s column name, and the value could be the column name or a list of column names in other engine’s result. If it is None, this dict will be init by self._init_column_mapping. The dict values could be either str or list, for exaplme: ` columns_mapping = { 'sequence': 'NakedSequence', #str 'charge': 'Charge', #str 'proteins':['Proteins','UniprotIDs'], # list, this reader will automatically detect all of them. } ` Defaults to None.
modification_mapping (dict, optional) – A dict that maps alphabase’s modifications to other engine’s. If it is None, this dict will be init by default modification mapping for each search engine (see psm_reader_yaml). The dict values can be either str or list, for exaplme: ` modification_mapping = { 'Oxidation@M': 'Oxidation (M)', # str 'Phospho@S': ['S(Phospho (STY))','S(ph)','pS'], # list, this reader will automatically detect all of them. } ` Defaults to None.
fdr (float, optional) – FDR level to keep PSMs. Defaults to 0.01.
keep_decoy (bool, optional) – If keep decoy PSMs in self.psm_df. Defautls to False.

column_mapping#

Dict structure same as column_mapping in Args.

Type:: dict

modification_mapping#

Dict structure same as modification_mapping in Args. We must use self.set_modification_mapping(new_mapping) to update it.

Type:: dict

_psm_df#

the PSM DataFrame after loading from search engines.

Type:: pd.DataFrame

psm_df#

the getter of self._psm_df

Type:: pd.DataFrame

keep_fdr#

The only PSMs with FDR<=keep_fdr were returned in self._psm_df.

Type:: float

keep_decoy#

If keep decoy PSMs in self.psm_df.

Type:: bool

_min_max_rt_norm#

if True, the ‘rt_norm’ values in self._psm_df will be normalized by rt_norm = (self.psm_df.rt-rt_min)/(rt_max-rt_min). It is useful to normalize iRT values as they contain negative values. Defaults to False.

Type:: bool

alphabase.psm_reader.pfind_reader.parse_pfind_protein(protein, keep_reverse=True)[source][source]#

alphabase.psm_reader.pfind_reader.translate_pFind_mod(mod_str)[source][source]#