{ "cells": [ { "cell_type": "markdown", "id": "d48b9454", "metadata": {}, "source": [ "# Protein Group readers" ] }, { "cell_type": "code", "execution_count": 1, "id": "3812811e", "metadata": { "tags": [ "hide-cell" ] }, "outputs": [], "source": [ "%reload_ext autoreload \n", "%autoreload 2 " ] }, { "cell_type": "code", "execution_count": 2, "id": "c0510071", "metadata": {}, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "/Users/lucas-diedrich/Documents/Projects/alphaX/alphabase/alphabase/alphabase/tools/data_downloader.py:4: DeprecationWarning: 'cgi' is deprecated and slated for removal in Python 3.13\n", " import cgi\n", "/Users/lucas-diedrich/Documents/Projects/alphaX/alphabase/alphabase/alphabase/tools/data_downloader.py:18: ImportWarning: Dependency 'progressbar' not installed. Download progress will not be displayed.\n", " warnings.warn(\n" ] } ], "source": [ "# Helper packages\n", "import io\n", "from copy import copy\n", "from typing import Literal, Optional\n", "\n", "import anndata as ad\n", "import numpy as np\n", "import pandas as pd\n", "\n", "# alphabase\n", "from alphabase.pg_reader import pg_reader_provider\n", "from alphabase.tools.data_downloader import DataShareDownloader" ] }, { "cell_type": "markdown", "id": "e17fe6eb", "metadata": {}, "source": [ "## Background \n", "\n", "The `alphabase.pg_reader` module provides a unifying interface **to read protein group (PG) tables** from different search engines and file formats. It is designed to be easy to use, and to provide a consistent output format in the form of `pandas.DataFrame`s, regardless of the input file format.\n", "\n", "### Introduction to protein group matrices\n", "\n", "Protein group matrices are the primary output for protein-level quantification in proteomics workflows. After search engines identify peptide spectrum matches (PSMs, see [PSM-reader tutorial](../nbs/psm_readers.ipynb)), they aggregate peptide-level evidence to infer protein-level abundances. These protein group tables represent a structured matrix that maps protein groups (features) to samples (observations), with estimated intensity values as entries.\n", "\n", "\n", "A minimal protein group table could look something like this:\n", "\n", "| proteins | sample_1 | sample_2 | sample_3 |\n", "|----------|----------|----------|----------|\n", "| P12345 | 1000.5 | 892.3 | 1150.7 |\n", "| Q67890 | 2500.1 | 2780.9 | 2340.2 |\n", "\n", "\n", "\n", "> 💡 Since some identified peptide sequences can match multiple proteins (such as isoforms or homologues), proteomics search engines typically handle this ambiguity by grouping these proteins into *protein groups* as features.\n", "\n", "\n", "In this example, protein P12345 has quantified intensities of 1000.5, 892.3, and 1150.7 in samples 1, 2, and 3 respectively.\n", "\n", "### Search engine outputs\n", "\n", "In reality, protein group tables are significantly more complex than this, as they contain additional feature-level information about the proteins (e.g., gene names, descriptions, alternative quantification methods), and the quantification (e.g., different intensity types like raw, LFQ quantification, iBAQ). This additional information can be valuable for downstream analyses, but also makes protein group tables a lot more difficult to work with, as the exact names and formats may differ between search engines, versions, and file formats.\n", "\n", "#### Unifying properties \n", "\n", "`alphabase` aligns the column names to a unified vocabulary, facilitating cross-engine comparisons. We can categorize protein group tables into several common types:\n", "\n", "**Type 1 — Minimal**: A basic features × samples matrix. Only intensity values are stored, with sample names as columns and protein groups as the index. *Example*: AlphaDIA.\n", "\n", "**Type 2 — Multiple Intensity Fields**: A wide matrix where each sample may appear multiple times with different quantification types (e.g., `SampleA_LFQ`, `SampleB_raw`). *Example*: AlphaPept.\n", "\n", "**Type 3 — Feature Metadata**: A features × samples matrix with one intensity value per sample, plus additional feature-level metadata columns (e.g., gene names, descriptions). *Example*: DIA-NN.\n", "\n", "**Type 4 — Combined**: A composite structure including both multiple intensity fields (Type 2) and feature-level metadata (Type 3). *Examples*: Spectronaut, MZTab, MaxQuant.\n" ] }, { "cell_type": "markdown", "id": "4547ae8c", "metadata": {}, "source": [ "## Code | Read and parse protein group tables\n", "\n", "The alphabase `pg_reader` module enables users to parse proteomics protein group reports to a dataframe for most common search engines with a single line of code via its `alphabase.pg_reader.pg_reader_provider` factory.\n", "\n", "\n", "All readers return a standardized pandas DataFrame with:\n", "- **Features as index**: Protein identifiers and metadata in the `pandas.DataFrame.Index`\n", "- **Samples as columns**: Sample/run identifiers as column index\n", "- **Intensity values**: Protein quantification data as `pandas.DataFrame.values`\n", "\n", "\n", "\n", "The readers **support different quantification methods** by matching regular expression patterns in the output tables and the **retrieval of desired metadata columns to standardized names**.\n", "\n", "\n", "The unified alphabase format enables seamless comparison and analysis across different search engines, facilitating:\n", "- Method comparison studies\n", "- Data integration workflows\n", "- Standardized downstream analysis pipelines" ] }, { "cell_type": "markdown", "id": "a433a22a", "metadata": {}, "source": [ "### Available readers \n", "\n", "\n", "`alphabase.pg_reader.pg_reader_provider` has registered reader classes for the most common proteomics search engines. A list of implemented readers can be accessed via its `reader_dict` property:" ] }, { "cell_type": "code", "execution_count": 3, "id": "d7eeeefd", "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Registered readers in alphabase:\n", "\t- alphadia\n", "\t- alphapept\n", "\t- diann\n", "\t- fragpipe\n", "\t- maxquant\n", "\t- mztab\n", "\t- spectronaut\n" ] } ], "source": [ "all_registered_readers = pg_reader_provider.reader_dict.keys()\n", "\n", "# Display all registered readers\n", "sep = \"\\n\\t- \"\n", "print(\"Registered readers in alphabase:\", sep.join(sorted(all_registered_readers)), sep=sep)" ] }, { "cell_type": "markdown", "id": "d8352b39", "metadata": {}, "source": [ "### Interact with the reader provider" ] }, { "cell_type": "code", "execution_count": null, "id": "48b68899", "metadata": {}, "outputs": [], "source": [ "def get_pg_matrix_example(output_dir: Optional[str] = None, search_engine: Literal[\"alphadia\", \"alphapept\", \"spectronaut\"] = \"alphadia\") -> str:\n", " \"\"\"Get example data for the tutorial\n", "\n", " The function downloads example data and stores it\n", " in `output_dir`, or, alternatively in a temporary directory\n", "\n", " Parameter\n", " ---------\n", " output_dir\n", " Output directory. If `None`, creates a temporary directory\n", "\n", " Returns\n", " -------\n", " File location\n", " \"\"\"\n", " EXAMPLE_URLS = {\n", " \"alphadia\": \"https://datashare.biochem.mpg.de/s/4AtCZassaUzRR8K\",\n", " \"alphapept\": \"https://datashare.biochem.mpg.de/s/6G6KHJqwcRPQiOO\",\n", " \"spectronaut\": \"https://datashare.biochem.mpg.de/s/2u7U03wvmQDVT4y\",\n", " }\n", "\n", " if search_engine not in EXAMPLE_URLS:\n", " raise KeyError(f\"{search_engine} not found, select one of {', '.join(EXAMPLE_URLS.keys())}\")\n", "\n", " if output_dir is None:\n", " from tempfile import tempdir\n", "\n", " output_dir = tempdir\n", "\n", " downloader = DataShareDownloader(url=EXAMPLE_URLS[search_engine], output_dir=output_dir)\n", "\n", " return downloader.download()" ] }, { "cell_type": "markdown", "id": "dded4cee", "metadata": {}, "source": [ "### Example 1 - AlphaDIA\n", "\n", "We demonstrate how to interact with protein group tables via alphabase based on a minimal example output of the AlphaDIA search engine. \n", "\n", "First, let's get some minimal example data for the AlphaDIA output. The example data represents a DIA run of 6 HeLA samples on the Orbitrap Astral. \n", "\n", "You can see that the output data contains the feature names in the column `pg` and the computed protein group intensities per sample in the remaining columns.\n" ] }, { "cell_type": "code", "execution_count": 5, "id": "7f4bc10f", "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "/var/folders/py/838_q5nd6594y27wbrpkhl3h0000gn/T/alphadia1.10.4__pg_matrix.tsv already exists (0.8597145080566406 MB)\n" ] }, { "data": { "text/html": [ "
| \n", " | pg | \n", "20231024_OA3_TiHe_ADIAMA_HeLa_200ng_Evo01_21min_F-40_iO_before_03 | \n", "20231024_OA3_TiHe_ADIAMA_HeLa_200ng_Evo01_21min_F-40_iO_before_02 | \n", "20231024_OA3_TiHe_ADIAMA_HeLa_200ng_Evo01_21min_F-40_iO_before_01 | \n", "20231023_OA3_TiHe_ADIAMA_HeLa_200ng_Evo01_21min_F-40_iO_after_03 | \n", "20231023_OA3_TiHe_ADIAMA_HeLa_200ng_Evo01_21min_F-40_iO_after_02 | \n", "20231023_OA3_TiHe_ADIAMA_HeLa_200ng_Evo01_21min_F-40_iO_after_01 | \n", "
|---|---|---|---|---|---|---|---|
| 0 | \n", "A0A024RBG1 | \n", "5.597816e+05 | \n", "6.285112e+05 | \n", "0.000000e+00 | \n", "3.153867e+05 | \n", "2.753702e+05 | \n", "4.505648e+05 | \n", "
| 1 | \n", "A0A024RBG1;Q9NZJ9 | \n", "1.331061e+06 | \n", "1.400360e+06 | \n", "1.551987e+06 | \n", "1.606095e+06 | \n", "1.464152e+06 | \n", "1.397026e+06 | \n", "
| 2 | \n", "A0A075B759;A0A075B767;P62937 | \n", "2.024742e+08 | \n", "8.552202e+06 | \n", "1.837425e+08 | \n", "1.674874e+08 | \n", "1.768245e+08 | \n", "1.595220e+08 | \n", "
| 3 | \n", "A0A096LP01 | \n", "6.355092e+05 | \n", "4.589410e+05 | \n", "4.184495e+05 | \n", "4.032932e+05 | \n", "2.317467e+05 | \n", "2.731363e+05 | \n", "
| 4 | \n", "A0A096LP49 | \n", "1.777069e+05 | \n", "1.387537e+05 | \n", "2.513601e+05 | \n", "1.296699e+05 | \n", "1.276095e+05 | \n", "1.623200e+05 | \n", "
| ... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
| 9359 | \n", "Q9Y6X3 | \n", "3.898963e+05 | \n", "4.353048e+05 | \n", "4.150456e+05 | \n", "5.069992e+05 | \n", "4.195746e+05 | \n", "3.675962e+05 | \n", "
| 9360 | \n", "Q9Y6X6 | \n", "1.869312e+05 | \n", "0.000000e+00 | \n", "0.000000e+00 | \n", "2.304623e+05 | \n", "2.421623e+05 | \n", "0.000000e+00 | \n", "
| 9361 | \n", "Q9Y6X9 | \n", "3.362758e+06 | \n", "3.395221e+06 | \n", "3.541975e+06 | \n", "2.704210e+06 | \n", "3.141519e+06 | \n", "2.995787e+06 | \n", "
| 9362 | \n", "Q9Y6Y0 | \n", "5.924220e+06 | \n", "6.183842e+06 | \n", "6.190598e+06 | \n", "6.025724e+06 | \n", "5.920595e+06 | \n", "6.754984e+06 | \n", "
| 9363 | \n", "Q9Y6Y8 | \n", "1.416146e+07 | \n", "1.424916e+07 | \n", "1.342342e+07 | \n", "1.345135e+07 | \n", "1.406395e+07 | \n", "1.349913e+07 | \n", "
9364 rows × 7 columns
\n", "| \n", " | 20231024_OA3_TiHe_ADIAMA_HeLa_200ng_Evo01_21min_F-40_iO_before_03 | \n", "20231024_OA3_TiHe_ADIAMA_HeLa_200ng_Evo01_21min_F-40_iO_before_02 | \n", "20231024_OA3_TiHe_ADIAMA_HeLa_200ng_Evo01_21min_F-40_iO_before_01 | \n", "20231023_OA3_TiHe_ADIAMA_HeLa_200ng_Evo01_21min_F-40_iO_after_03 | \n", "20231023_OA3_TiHe_ADIAMA_HeLa_200ng_Evo01_21min_F-40_iO_after_02 | \n", "20231023_OA3_TiHe_ADIAMA_HeLa_200ng_Evo01_21min_F-40_iO_after_01 | \n", "
|---|---|---|---|---|---|---|
| uniprot_ids | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
| A0A024RBG1 | \n", "5.597816e+05 | \n", "6.285112e+05 | \n", "0.000000e+00 | \n", "3.153867e+05 | \n", "2.753702e+05 | \n", "4.505648e+05 | \n", "
| A0A024RBG1;Q9NZJ9 | \n", "1.331061e+06 | \n", "1.400360e+06 | \n", "1.551987e+06 | \n", "1.606095e+06 | \n", "1.464152e+06 | \n", "1.397026e+06 | \n", "
| A0A075B759;A0A075B767;P62937 | \n", "2.024742e+08 | \n", "8.552202e+06 | \n", "1.837425e+08 | \n", "1.674874e+08 | \n", "1.768245e+08 | \n", "1.595220e+08 | \n", "
| A0A096LP01 | \n", "6.355092e+05 | \n", "4.589410e+05 | \n", "4.184495e+05 | \n", "4.032932e+05 | \n", "2.317467e+05 | \n", "2.731363e+05 | \n", "
| A0A096LP49 | \n", "1.777069e+05 | \n", "1.387537e+05 | \n", "2.513601e+05 | \n", "1.296699e+05 | \n", "1.276095e+05 | \n", "1.623200e+05 | \n", "
| ... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
| Q9Y6X3 | \n", "3.898963e+05 | \n", "4.353048e+05 | \n", "4.150456e+05 | \n", "5.069992e+05 | \n", "4.195746e+05 | \n", "3.675962e+05 | \n", "
| Q9Y6X6 | \n", "1.869312e+05 | \n", "0.000000e+00 | \n", "0.000000e+00 | \n", "2.304623e+05 | \n", "2.421623e+05 | \n", "0.000000e+00 | \n", "
| Q9Y6X9 | \n", "3.362758e+06 | \n", "3.395221e+06 | \n", "3.541975e+06 | \n", "2.704210e+06 | \n", "3.141519e+06 | \n", "2.995787e+06 | \n", "
| Q9Y6Y0 | \n", "5.924220e+06 | \n", "6.183842e+06 | \n", "6.190598e+06 | \n", "6.025724e+06 | \n", "5.920595e+06 | \n", "6.754984e+06 | \n", "
| Q9Y6Y8 | \n", "1.416146e+07 | \n", "1.424916e+07 | \n", "1.342342e+07 | \n", "1.345135e+07 | \n", "1.406395e+07 | \n", "1.349913e+07 | \n", "
9364 rows × 6 columns
\n", "| \n", " | Unnamed: 0 | \n", "A_LFQ | \n", "B_LFQ | \n", "A | \n", "B | \n", "
|---|---|---|---|---|---|
| 0 | \n", "sp|P36578|RL4_HUMAN | \n", "4.669329e+08 | \n", "4.844083e+08 | \n", "4.452735e+08 | \n", "5.060678e+08 | \n", "
| 1 | \n", "sp|Q9P258|RCC2_HUMAN | \n", "4.074842e+08 | \n", "4.138132e+08 | \n", "4.177856e+08 | \n", "4.035118e+08 | \n", "
| 2 | \n", "sp|O60518|RNBP6_HUMAN | \n", "4.960386e+06 | \n", "2.022553e+06 | \n", "1.295621e+06 | \n", "5.687318e+06 | \n", "
| 3 | \n", "sp|P55036|PSMD4_HUMAN | \n", "1.157420e+08 | \n", "1.123571e+08 | \n", "1.130880e+08 | \n", "1.150112e+08 | \n", "
| 4 | \n", "sp|A1X283|SPD2B_HUMAN | \n", "1.247112e+07 | \n", "1.180582e+07 | \n", "1.380177e+07 | \n", "1.047516e+07 | \n", "
| ... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
| 3776 | \n", "sp|Q14966|ZN638_HUMAN | \n", "NaN | \n", "1.139844e+06 | \n", "NaN | \n", "1.139844e+06 | \n", "
| 3777 | \n", "sp|P84095|RHOG_HUMAN | \n", "NaN | \n", "9.466796e+05 | \n", "NaN | \n", "9.466796e+05 | \n", "
| 3778 | \n", "sp|Q99766|ATP5S_HUMAN | \n", "NaN | \n", "3.577785e+05 | \n", "NaN | \n", "3.577785e+05 | \n", "
| 3779 | \n", "sp|O14925|TIM23_HUMAN,sp|Q5SRD1|TI23B_HUMAN | \n", "NaN | \n", "9.237994e+05 | \n", "NaN | \n", "9.237994e+05 | \n", "
| 3780 | \n", "sp|P51946|CCNH_HUMAN | \n", "NaN | \n", "9.278844e+05 | \n", "NaN | \n", "9.278844e+05 | \n", "
3781 rows × 5 columns
\n", "| \n", " | \n", " | \n", " | \n", " | \n", " | A | \n", "B | \n", "
|---|---|---|---|---|---|---|
| proteins | \n", "uniprot_ids | \n", "ensembl_ids | \n", "source_db | \n", "is_decoy | \n", "\n", " | \n", " |
| RL4_HUMAN | \n", "P36578 | \n", "na | \n", "sp | \n", "False | \n", "445273477.0318756 | \n", "506067774.6891948 | \n", "
| RCC2_HUMAN | \n", "Q9P258 | \n", "na | \n", "sp | \n", "False | \n", "417785611.6324583 | \n", "403511752.8857417 | \n", "
| RNBP6_HUMAN | \n", "O60518 | \n", "na | \n", "sp | \n", "False | \n", "1295621.2466679448 | \n", "5687318.493374016 | \n", "
| PSMD4_HUMAN | \n", "P55036 | \n", "na | \n", "sp | \n", "False | \n", "113087994.44403341 | \n", "115011156.7335174 | \n", "
| SPD2B_HUMAN | \n", "A1X283 | \n", "na | \n", "sp | \n", "False | \n", "13801771.733223092 | \n", "10475164.42857083 | \n", "
| ... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
| ZN638_HUMAN | \n", "Q14966 | \n", "na | \n", "sp | \n", "False | \n", "\n", " | 1139843.6453892316 | \n", "
| RHOG_HUMAN | \n", "P84095 | \n", "na | \n", "sp | \n", "False | \n", "\n", " | 946679.6466570131 | \n", "
| ATP5S_HUMAN | \n", "Q99766 | \n", "na | \n", "sp | \n", "False | \n", "\n", " | 357778.52002529387 | \n", "
| TIM23_HUMAN;TI23B_HUMAN | \n", "O14925;Q5SRD1 | \n", "na;na | \n", "sp;sp | \n", "False | \n", "\n", " | 923799.3856913601 | \n", "
| CCNH_HUMAN | \n", "P51946 | \n", "na | \n", "sp | \n", "False | \n", "\n", " | 927884.4020782198 | \n", "
3781 rows × 2 columns
\n", "| \n", " | \n", " | \n", " | \n", " | \n", " | A_LFQ | \n", "B_LFQ | \n", "
|---|---|---|---|---|---|---|
| proteins | \n", "uniprot_ids | \n", "ensembl_ids | \n", "source_db | \n", "is_decoy | \n", "\n", " | \n", " |
| RL4_HUMAN | \n", "P36578 | \n", "na | \n", "sp | \n", "False | \n", "466932936.27537036 | \n", "484408315.44570005 | \n", "
| RCC2_HUMAN | \n", "Q9P258 | \n", "na | \n", "sp | \n", "False | \n", "407484183.9302226 | \n", "413813180.5879775 | \n", "
| RNBP6_HUMAN | \n", "O60518 | \n", "na | \n", "sp | \n", "False | \n", "4960386.374516514 | \n", "2022553.3655254466 | \n", "
| PSMD4_HUMAN | \n", "P55036 | \n", "na | \n", "sp | \n", "False | \n", "115742020.94987468 | \n", "112357130.22767611 | \n", "
| SPD2B_HUMAN | \n", "A1X283 | \n", "na | \n", "sp | \n", "False | \n", "12471120.728621317 | \n", "11805815.433172602 | \n", "
| ... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
| ZN638_HUMAN | \n", "Q14966 | \n", "na | \n", "sp | \n", "False | \n", "\n", " | 1139843.6453892316 | \n", "
| RHOG_HUMAN | \n", "P84095 | \n", "na | \n", "sp | \n", "False | \n", "\n", " | 946679.6466570131 | \n", "
| ATP5S_HUMAN | \n", "Q99766 | \n", "na | \n", "sp | \n", "False | \n", "\n", " | 357778.52002529387 | \n", "
| TIM23_HUMAN;TI23B_HUMAN | \n", "O14925;Q5SRD1 | \n", "na;na | \n", "sp;sp | \n", "False | \n", "\n", " | 923799.3856913601 | \n", "
| CCNH_HUMAN | \n", "P51946 | \n", "na | \n", "sp | \n", "False | \n", "\n", " | 927884.4020782198 | \n", "
3781 rows × 2 columns
\n", "| \n", " | PG.Genes | \n", "PG.Organisms | \n", "PG.ProteinNames | \n", "PTM.CollapseKey | \n", "PTM.FlankingRegion | \n", "PTM.ModificationTitle | \n", "PTM.Multiplicity | \n", "PTM.ProteinId | \n", "PTM.SiteAA | \n", "PTM.SiteLocation | \n", "... | \n", "[27] 20180816_QE3_nLC3_AH_DIA_H100_Y25_03.raw.PTM.Quantity | \n", "[28] 20180816_QE3_nLC3_AH_DIA_H100_Y25_04.raw.PTM.Quantity | \n", "[29] 20180816_QE3_nLC3_AH_DIA_H100_Y25_05.raw.PTM.Quantity | \n", "[30] 20180816_QE3_nLC3_AH_DIA_H100_Y25_06.raw.PTM.Quantity | \n", "[31] 20180816_QE3_nLC3_AH_DIA_H100_Y50_01.raw.PTM.Quantity | \n", "[32] 20180816_QE3_nLC3_AH_DIA_H100_Y50_02.raw.PTM.Quantity | \n", "[33] 20180816_QE3_nLC3_AH_DIA_H100_Y50_03.raw.PTM.Quantity | \n", "[34] 20180816_QE3_nLC3_AH_DIA_H100_Y50_04.raw.PTM.Quantity | \n", "[35] 20180816_QE3_nLC3_AH_DIA_H100_Y50_05.raw.PTM.Quantity | \n", "[36] 20180816_QE3_nLC3_AH_DIA_H100_Y50_06.raw.PTM.Quantity | \n", "
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n", "TRBV19;TRB | \n", "Homo sapiens | \n", "TVB19_HUMAN;TRBR1_HUMAN | \n", "A0A075B6N1_S86_M3 | \n", "IAEGYSVSREKKESF | \n", "Phospho (STY) | \n", "3 | \n", "A0A075B6N1 | \n", "S | \n", "86 | \n", "... | \n", "69968.8359375 | \n", "103632.6015625 | \n", "90488.9296875 | \n", "113429.859375 | \n", "96970.2734375 | \n", "61069.171875 | \n", "99673.2734375 | \n", "109199.875 | \n", "112307.4765625 | \n", "112374.84375 | \n", "
| 1 | \n", "TRBV19;TRB | \n", "Homo sapiens | \n", "TVB19_HUMAN;TRBR1_HUMAN | \n", "A0A075B6N1_S84_M3 | \n", "GDIAEGYSVSREKKE | \n", "Phospho (STY) | \n", "3 | \n", "A0A075B6N1 | \n", "S | \n", "84 | \n", "... | \n", "69968.8359375 | \n", "103632.6015625 | \n", "90488.9296875 | \n", "113429.859375 | \n", "96970.2734375 | \n", "61069.171875 | \n", "99673.2734375 | \n", "109199.875 | \n", "112307.4765625 | \n", "112374.84375 | \n", "
| 2 | \n", "TRBV19;TRB | \n", "Homo sapiens | \n", "TVB19_HUMAN;TRBR1_HUMAN | \n", "A0A075B6N1_Y83_M3 | \n", "KGDIAEGYSVSREKK | \n", "Phospho (STY) | \n", "3 | \n", "A0A075B6N1 | \n", "Y | \n", "83 | \n", "... | \n", "69968.8359375 | \n", "103632.6015625 | \n", "90488.9296875 | \n", "113429.859375 | \n", "96970.2734375 | \n", "61069.171875 | \n", "99673.2734375 | \n", "109199.875 | \n", "112307.4765625 | \n", "112374.84375 | \n", "
| 3 | \n", "TRBV19;TRB | \n", "Homo sapiens | \n", "TVB19_HUMAN;TRBR1_HUMAN | \n", "P0DSE2_S86_M3 | \n", "IAEGYSVSREKKESF | \n", "Phospho (STY) | \n", "3 | \n", "P0DSE2 | \n", "S | \n", "86 | \n", "... | \n", "69968.8359375 | \n", "103632.6015625 | \n", "90488.9296875 | \n", "113429.859375 | \n", "96970.2734375 | \n", "61069.171875 | \n", "99673.2734375 | \n", "109199.875 | \n", "112307.4765625 | \n", "112374.84375 | \n", "
| 4 | \n", "TRBV19;TRB | \n", "Homo sapiens | \n", "TVB19_HUMAN;TRBR1_HUMAN | \n", "P0DSE2_S84_M3 | \n", "GDIAEGYSVSREKKE | \n", "Phospho (STY) | \n", "3 | \n", "P0DSE2 | \n", "S | \n", "84 | \n", "... | \n", "69968.8359375 | \n", "103632.6015625 | \n", "90488.9296875 | \n", "113429.859375 | \n", "96970.2734375 | \n", "61069.171875 | \n", "99673.2734375 | \n", "109199.875 | \n", "112307.4765625 | \n", "112374.84375 | \n", "
| ... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
| 54858 | \n", "MORC2 | \n", "Homo sapiens | \n", "MORC2_HUMAN | \n", "Q9Y6X9_S739_M2 | \n", "ATPSRKRSVAVSDEE | \n", "Phospho (STY) | \n", "2 | \n", "Q9Y6X9 | \n", "S | \n", "739 | \n", "... | \n", "23552.466796875 | \n", "22144.580078125 | \n", "20846.8515625 | \n", "24248.41796875 | \n", "22490.0546875 | \n", "22095.990234375 | \n", "25553.849609375 | \n", "22250.546875 | \n", "14592.869140625 | \n", "19265.998046875 | \n", "
| 54859 | \n", "MORC2 | \n", "Homo sapiens | \n", "MORC2_HUMAN | \n", "Q9Y6X9-2_S681_M2 | \n", "RKRSVAVSDEEEVEE | \n", "Phospho (STY) | \n", "2 | \n", "Q9Y6X9-2 | \n", "S | \n", "681 | \n", "... | \n", "23552.466796875 | \n", "22144.580078125 | \n", "20846.8515625 | \n", "24248.41796875 | \n", "22490.0546875 | \n", "22095.990234375 | \n", "25553.849609375 | \n", "22250.546875 | \n", "14592.869140625 | \n", "19265.998046875 | \n", "
| 54860 | \n", "MORC2 | \n", "Homo sapiens | \n", "MORC2_HUMAN | \n", "Q9Y6X9-2_S677_M2 | \n", "ATPSRKRSVAVSDEE | \n", "Phospho (STY) | \n", "2 | \n", "Q9Y6X9-2 | \n", "S | \n", "677 | \n", "... | \n", "23552.466796875 | \n", "22144.580078125 | \n", "20846.8515625 | \n", "24248.41796875 | \n", "22490.0546875 | \n", "22095.990234375 | \n", "25553.849609375 | \n", "22250.546875 | \n", "14592.869140625 | \n", "19265.998046875 | \n", "
| 54861 | \n", "IVNS1ABP | \n", "Homo sapiens | \n", "NS1BP_HUMAN | \n", "Q9Y6Y0_M341_M1 | \n", "SKSLSFEMQQDELIE | \n", "Oxidation (M) | \n", "1 | \n", "Q9Y6Y0 | \n", "M | \n", "341 | \n", "... | \n", "Filtered | \n", "17287.40625 | \n", "Filtered | \n", "15751.861328125 | \n", "14749.724609375 | \n", "12410.79296875 | \n", "14130.1396484375 | \n", "Filtered | \n", "13198.474609375 | \n", "13553.0908203125 | \n", "
| 54862 | \n", "IVNS1ABP | \n", "Homo sapiens | \n", "NS1BP_HUMAN | \n", "Q9Y6Y0_S338_M1 | \n", "PKLSKSLSFEMQQDE | \n", "Phospho (STY) | \n", "1 | \n", "Q9Y6Y0 | \n", "S | \n", "338 | \n", "... | \n", "Filtered | \n", "17287.40625 | \n", "Filtered | \n", "15751.861328125 | \n", "14749.724609375 | \n", "12410.79296875 | \n", "14130.1396484375 | \n", "7562.62060546875 | \n", "13198.474609375 | \n", "13553.0908203125 | \n", "
54863 rows × 46 columns
\n", "" ], "text/plain": [ " PG.Genes PG.Organisms PG.ProteinNames PTM.CollapseKey \\\n", "0 TRBV19;TRB Homo sapiens TVB19_HUMAN;TRBR1_HUMAN A0A075B6N1_S86_M3 \n", "1 TRBV19;TRB Homo sapiens TVB19_HUMAN;TRBR1_HUMAN A0A075B6N1_S84_M3 \n", "2 TRBV19;TRB Homo sapiens TVB19_HUMAN;TRBR1_HUMAN A0A075B6N1_Y83_M3 \n", "3 TRBV19;TRB Homo sapiens TVB19_HUMAN;TRBR1_HUMAN P0DSE2_S86_M3 \n", "4 TRBV19;TRB Homo sapiens TVB19_HUMAN;TRBR1_HUMAN P0DSE2_S84_M3 \n", "... ... ... ... ... \n", "54858 MORC2 Homo sapiens MORC2_HUMAN Q9Y6X9_S739_M2 \n", "54859 MORC2 Homo sapiens MORC2_HUMAN Q9Y6X9-2_S681_M2 \n", "54860 MORC2 Homo sapiens MORC2_HUMAN Q9Y6X9-2_S677_M2 \n", "54861 IVNS1ABP Homo sapiens NS1BP_HUMAN Q9Y6Y0_M341_M1 \n", "54862 IVNS1ABP Homo sapiens NS1BP_HUMAN Q9Y6Y0_S338_M1 \n", "\n", " PTM.FlankingRegion PTM.ModificationTitle PTM.Multiplicity \\\n", "0 IAEGYSVSREKKESF Phospho (STY) 3 \n", "1 GDIAEGYSVSREKKE Phospho (STY) 3 \n", "2 KGDIAEGYSVSREKK Phospho (STY) 3 \n", "3 IAEGYSVSREKKESF Phospho (STY) 3 \n", "4 GDIAEGYSVSREKKE Phospho (STY) 3 \n", "... ... ... ... \n", "54858 ATPSRKRSVAVSDEE Phospho (STY) 2 \n", "54859 RKRSVAVSDEEEVEE Phospho (STY) 2 \n", "54860 ATPSRKRSVAVSDEE Phospho (STY) 2 \n", "54861 SKSLSFEMQQDELIE Oxidation (M) 1 \n", "54862 PKLSKSLSFEMQQDE Phospho (STY) 1 \n", "\n", " PTM.ProteinId PTM.SiteAA PTM.SiteLocation ... \\\n", "0 A0A075B6N1 S 86 ... \n", "1 A0A075B6N1 S 84 ... \n", "2 A0A075B6N1 Y 83 ... \n", "3 P0DSE2 S 86 ... \n", "4 P0DSE2 S 84 ... \n", "... ... ... ... ... \n", "54858 Q9Y6X9 S 739 ... \n", "54859 Q9Y6X9-2 S 681 ... \n", "54860 Q9Y6X9-2 S 677 ... \n", "54861 Q9Y6Y0 M 341 ... \n", "54862 Q9Y6Y0 S 338 ... \n", "\n", " [27] 20180816_QE3_nLC3_AH_DIA_H100_Y25_03.raw.PTM.Quantity \\\n", "0 69968.8359375 \n", "1 69968.8359375 \n", "2 69968.8359375 \n", "3 69968.8359375 \n", "4 69968.8359375 \n", "... ... \n", "54858 23552.466796875 \n", "54859 23552.466796875 \n", "54860 23552.466796875 \n", "54861 Filtered \n", "54862 Filtered \n", "\n", " [28] 20180816_QE3_nLC3_AH_DIA_H100_Y25_04.raw.PTM.Quantity \\\n", "0 103632.6015625 \n", "1 103632.6015625 \n", "2 103632.6015625 \n", "3 103632.6015625 \n", "4 103632.6015625 \n", "... ... \n", "54858 22144.580078125 \n", "54859 22144.580078125 \n", "54860 22144.580078125 \n", "54861 17287.40625 \n", "54862 17287.40625 \n", "\n", " [29] 20180816_QE3_nLC3_AH_DIA_H100_Y25_05.raw.PTM.Quantity \\\n", "0 90488.9296875 \n", "1 90488.9296875 \n", "2 90488.9296875 \n", "3 90488.9296875 \n", "4 90488.9296875 \n", "... ... \n", "54858 20846.8515625 \n", "54859 20846.8515625 \n", "54860 20846.8515625 \n", "54861 Filtered \n", "54862 Filtered \n", "\n", " [30] 20180816_QE3_nLC3_AH_DIA_H100_Y25_06.raw.PTM.Quantity \\\n", "0 113429.859375 \n", "1 113429.859375 \n", "2 113429.859375 \n", "3 113429.859375 \n", "4 113429.859375 \n", "... ... \n", "54858 24248.41796875 \n", "54859 24248.41796875 \n", "54860 24248.41796875 \n", "54861 15751.861328125 \n", "54862 15751.861328125 \n", "\n", " [31] 20180816_QE3_nLC3_AH_DIA_H100_Y50_01.raw.PTM.Quantity \\\n", "0 96970.2734375 \n", "1 96970.2734375 \n", "2 96970.2734375 \n", "3 96970.2734375 \n", "4 96970.2734375 \n", "... ... \n", "54858 22490.0546875 \n", "54859 22490.0546875 \n", "54860 22490.0546875 \n", "54861 14749.724609375 \n", "54862 14749.724609375 \n", "\n", " [32] 20180816_QE3_nLC3_AH_DIA_H100_Y50_02.raw.PTM.Quantity \\\n", "0 61069.171875 \n", "1 61069.171875 \n", "2 61069.171875 \n", "3 61069.171875 \n", "4 61069.171875 \n", "... ... \n", "54858 22095.990234375 \n", "54859 22095.990234375 \n", "54860 22095.990234375 \n", "54861 12410.79296875 \n", "54862 12410.79296875 \n", "\n", " [33] 20180816_QE3_nLC3_AH_DIA_H100_Y50_03.raw.PTM.Quantity \\\n", "0 99673.2734375 \n", "1 99673.2734375 \n", "2 99673.2734375 \n", "3 99673.2734375 \n", "4 99673.2734375 \n", "... ... \n", "54858 25553.849609375 \n", "54859 25553.849609375 \n", "54860 25553.849609375 \n", "54861 14130.1396484375 \n", "54862 14130.1396484375 \n", "\n", " [34] 20180816_QE3_nLC3_AH_DIA_H100_Y50_04.raw.PTM.Quantity \\\n", "0 109199.875 \n", "1 109199.875 \n", "2 109199.875 \n", "3 109199.875 \n", "4 109199.875 \n", "... ... \n", "54858 22250.546875 \n", "54859 22250.546875 \n", "54860 22250.546875 \n", "54861 Filtered \n", "54862 7562.62060546875 \n", "\n", " [35] 20180816_QE3_nLC3_AH_DIA_H100_Y50_05.raw.PTM.Quantity \\\n", "0 112307.4765625 \n", "1 112307.4765625 \n", "2 112307.4765625 \n", "3 112307.4765625 \n", "4 112307.4765625 \n", "... ... \n", "54858 14592.869140625 \n", "54859 14592.869140625 \n", "54860 14592.869140625 \n", "54861 13198.474609375 \n", "54862 13198.474609375 \n", "\n", " [36] 20180816_QE3_nLC3_AH_DIA_H100_Y50_06.raw.PTM.Quantity \n", "0 112374.84375 \n", "1 112374.84375 \n", "2 112374.84375 \n", "3 112374.84375 \n", "4 112374.84375 \n", "... ... \n", "54858 19265.998046875 \n", "54859 19265.998046875 \n", "54860 19265.998046875 \n", "54861 13553.0908203125 \n", "54862 13553.0908203125 \n", "\n", "[54863 rows x 46 columns]" ] }, "execution_count": 11, "metadata": {}, "output_type": "execute_result" } ], "source": [ "spectronaut_example_path = get_pg_matrix_example(search_engine=\"spectronaut\")\n", "\n", "# Parse with pandas for visualization purposes\n", "pd.read_csv(spectronaut_example_path, sep=\"\\t\")" ] }, { "cell_type": "markdown", "id": "e3c63473", "metadata": {}, "source": [ "The default reader extracts some streamlined information" ] }, { "cell_type": "code", "execution_count": 12, "id": "53449c57", "metadata": {}, "outputs": [ { "data": { "text/html": [ "| \n", " | \n", " | [1] 20180815_QE3_nLC3_AH_DIA_Honly_ind_01.raw.PTM.Quantity | \n", "[2] 20180815_QE3_nLC3_AH_DIA_Honly_ind_02.raw.PTM.Quantity | \n", "[3] 20180815_QE3_nLC3_AH_DIA_Honly_ind_03.raw.PTM.Quantity | \n", "[4] 20180815_QE3_nLC3_AH_DIA_Yonly_ind_01.raw.PTM.Quantity | \n", "[5] 20180815_QE3_nLC3_AH_DIA_Yonly_ind_02.raw.PTM.Quantity | \n", "[6] 20180815_QE3_nLC3_AH_DIA_Yonly_ind_03.raw.PTM.Quantity | \n", "[7] 20180816_QE3_nLC3_AH_DIA_H100_Y100_01.raw.PTM.Quantity | \n", "[8] 20180816_QE3_nLC3_AH_DIA_H100_Y100_02.raw.PTM.Quantity | \n", "[9] 20180816_QE3_nLC3_AH_DIA_H100_Y100_03.raw.PTM.Quantity | \n", "[10] 20180816_QE3_nLC3_AH_DIA_H100_Y100_04.raw.PTM.Quantity | \n", "... | \n", "[27] 20180816_QE3_nLC3_AH_DIA_H100_Y25_03.raw.PTM.Quantity | \n", "[28] 20180816_QE3_nLC3_AH_DIA_H100_Y25_04.raw.PTM.Quantity | \n", "[29] 20180816_QE3_nLC3_AH_DIA_H100_Y25_05.raw.PTM.Quantity | \n", "[30] 20180816_QE3_nLC3_AH_DIA_H100_Y25_06.raw.PTM.Quantity | \n", "[31] 20180816_QE3_nLC3_AH_DIA_H100_Y50_01.raw.PTM.Quantity | \n", "[32] 20180816_QE3_nLC3_AH_DIA_H100_Y50_02.raw.PTM.Quantity | \n", "[33] 20180816_QE3_nLC3_AH_DIA_H100_Y50_03.raw.PTM.Quantity | \n", "[34] 20180816_QE3_nLC3_AH_DIA_H100_Y50_04.raw.PTM.Quantity | \n", "[35] 20180816_QE3_nLC3_AH_DIA_H100_Y50_05.raw.PTM.Quantity | \n", "[36] 20180816_QE3_nLC3_AH_DIA_H100_Y50_06.raw.PTM.Quantity | \n", "
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| proteins | \n", "genes | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
| TVB19_HUMAN;TRBR1_HUMAN | \n", "TRBV19;TRB | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "89374.656250 | \n", "NaN | \n", "90181.578125 | \n", "96197.070312 | \n", "... | \n", "69968.835938 | \n", "103632.601562 | \n", "90488.929688 | \n", "113429.859375 | \n", "96970.273438 | \n", "61069.171875 | \n", "99673.273438 | \n", "109199.875000 | \n", "112307.476562 | \n", "112374.843750 | \n", "
| TRBV19;TRB | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "89374.656250 | \n", "NaN | \n", "90181.578125 | \n", "96197.070312 | \n", "... | \n", "69968.835938 | \n", "103632.601562 | \n", "90488.929688 | \n", "113429.859375 | \n", "96970.273438 | \n", "61069.171875 | \n", "99673.273438 | \n", "109199.875000 | \n", "112307.476562 | \n", "112374.843750 | \n", "|
| TRBV19;TRB | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "89374.656250 | \n", "NaN | \n", "90181.578125 | \n", "96197.070312 | \n", "... | \n", "69968.835938 | \n", "103632.601562 | \n", "90488.929688 | \n", "113429.859375 | \n", "96970.273438 | \n", "61069.171875 | \n", "99673.273438 | \n", "109199.875000 | \n", "112307.476562 | \n", "112374.843750 | \n", "|
| TRBV19;TRB | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "89374.656250 | \n", "NaN | \n", "90181.578125 | \n", "96197.070312 | \n", "... | \n", "69968.835938 | \n", "103632.601562 | \n", "90488.929688 | \n", "113429.859375 | \n", "96970.273438 | \n", "61069.171875 | \n", "99673.273438 | \n", "109199.875000 | \n", "112307.476562 | \n", "112374.843750 | \n", "|
| TRBV19;TRB | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "89374.656250 | \n", "NaN | \n", "90181.578125 | \n", "96197.070312 | \n", "... | \n", "69968.835938 | \n", "103632.601562 | \n", "90488.929688 | \n", "113429.859375 | \n", "96970.273438 | \n", "61069.171875 | \n", "99673.273438 | \n", "109199.875000 | \n", "112307.476562 | \n", "112374.843750 | \n", "|
| ... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
| MORC2_HUMAN | \n", "MORC2 | \n", "NaN | \n", "NaN | \n", "6817.745605 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "18010.679688 | \n", "12501.521484 | \n", "17377.408203 | \n", "13730.358398 | \n", "... | \n", "23552.466797 | \n", "22144.580078 | \n", "20846.851562 | \n", "24248.417969 | \n", "22490.054688 | \n", "22095.990234 | \n", "25553.849609 | \n", "22250.546875 | \n", "14592.869141 | \n", "19265.998047 | \n", "
| MORC2 | \n", "NaN | \n", "NaN | \n", "6817.745605 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "18010.679688 | \n", "12501.521484 | \n", "17377.408203 | \n", "13730.358398 | \n", "... | \n", "23552.466797 | \n", "22144.580078 | \n", "20846.851562 | \n", "24248.417969 | \n", "22490.054688 | \n", "22095.990234 | \n", "25553.849609 | \n", "22250.546875 | \n", "14592.869141 | \n", "19265.998047 | \n", "|
| MORC2 | \n", "NaN | \n", "NaN | \n", "6817.745605 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "18010.679688 | \n", "12501.521484 | \n", "17377.408203 | \n", "13730.358398 | \n", "... | \n", "23552.466797 | \n", "22144.580078 | \n", "20846.851562 | \n", "24248.417969 | \n", "22490.054688 | \n", "22095.990234 | \n", "25553.849609 | \n", "22250.546875 | \n", "14592.869141 | \n", "19265.998047 | \n", "|
| NS1BP_HUMAN | \n", "IVNS1ABP | \n", "NaN | \n", "NaN | \n", "38411.285156 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "10104.601562 | \n", "12773.764648 | \n", "10412.311523 | \n", "11411.670898 | \n", "... | \n", "NaN | \n", "17287.406250 | \n", "NaN | \n", "15751.861328 | \n", "14749.724609 | \n", "12410.792969 | \n", "14130.139648 | \n", "NaN | \n", "13198.474609 | \n", "13553.090820 | \n", "
| IVNS1ABP | \n", "NaN | \n", "NaN | \n", "38411.285156 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "10104.601562 | \n", "18788.167969 | \n", "10412.311523 | \n", "17367.800781 | \n", "... | \n", "NaN | \n", "17287.406250 | \n", "NaN | \n", "15751.861328 | \n", "14749.724609 | \n", "12410.792969 | \n", "14130.139648 | \n", "7562.620605 | \n", "13198.474609 | \n", "13553.090820 | \n", "
54863 rows × 36 columns
\n", "| \n", " | \n", " | \n", " | [1] 20180815_QE3_nLC3_AH_DIA_Honly_ind_01.raw.PTM.Quantity | \n", "[2] 20180815_QE3_nLC3_AH_DIA_Honly_ind_02.raw.PTM.Quantity | \n", "[3] 20180815_QE3_nLC3_AH_DIA_Honly_ind_03.raw.PTM.Quantity | \n", "[4] 20180815_QE3_nLC3_AH_DIA_Yonly_ind_01.raw.PTM.Quantity | \n", "[5] 20180815_QE3_nLC3_AH_DIA_Yonly_ind_02.raw.PTM.Quantity | \n", "[6] 20180815_QE3_nLC3_AH_DIA_Yonly_ind_03.raw.PTM.Quantity | \n", "[7] 20180816_QE3_nLC3_AH_DIA_H100_Y100_01.raw.PTM.Quantity | \n", "[8] 20180816_QE3_nLC3_AH_DIA_H100_Y100_02.raw.PTM.Quantity | \n", "[9] 20180816_QE3_nLC3_AH_DIA_H100_Y100_03.raw.PTM.Quantity | \n", "[10] 20180816_QE3_nLC3_AH_DIA_H100_Y100_04.raw.PTM.Quantity | \n", "... | \n", "[27] 20180816_QE3_nLC3_AH_DIA_H100_Y25_03.raw.PTM.Quantity | \n", "[28] 20180816_QE3_nLC3_AH_DIA_H100_Y25_04.raw.PTM.Quantity | \n", "[29] 20180816_QE3_nLC3_AH_DIA_H100_Y25_05.raw.PTM.Quantity | \n", "[30] 20180816_QE3_nLC3_AH_DIA_H100_Y25_06.raw.PTM.Quantity | \n", "[31] 20180816_QE3_nLC3_AH_DIA_H100_Y50_01.raw.PTM.Quantity | \n", "[32] 20180816_QE3_nLC3_AH_DIA_H100_Y50_02.raw.PTM.Quantity | \n", "[33] 20180816_QE3_nLC3_AH_DIA_H100_Y50_03.raw.PTM.Quantity | \n", "[34] 20180816_QE3_nLC3_AH_DIA_H100_Y50_04.raw.PTM.Quantity | \n", "[35] 20180816_QE3_nLC3_AH_DIA_H100_Y50_05.raw.PTM.Quantity | \n", "[36] 20180816_QE3_nLC3_AH_DIA_H100_Y50_06.raw.PTM.Quantity | \n", "
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| proteins | \n", "genes | \n", "ptm_site_amino_acid | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
| TVB19_HUMAN;TRBR1_HUMAN | \n", "TRBV19;TRB | \n", "S | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "89374.656250 | \n", "NaN | \n", "90181.578125 | \n", "96197.070312 | \n", "... | \n", "69968.835938 | \n", "103632.601562 | \n", "90488.929688 | \n", "113429.859375 | \n", "96970.273438 | \n", "61069.171875 | \n", "99673.273438 | \n", "109199.875000 | \n", "112307.476562 | \n", "112374.843750 | \n", "
| S | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "89374.656250 | \n", "NaN | \n", "90181.578125 | \n", "96197.070312 | \n", "... | \n", "69968.835938 | \n", "103632.601562 | \n", "90488.929688 | \n", "113429.859375 | \n", "96970.273438 | \n", "61069.171875 | \n", "99673.273438 | \n", "109199.875000 | \n", "112307.476562 | \n", "112374.843750 | \n", "||
| Y | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "89374.656250 | \n", "NaN | \n", "90181.578125 | \n", "96197.070312 | \n", "... | \n", "69968.835938 | \n", "103632.601562 | \n", "90488.929688 | \n", "113429.859375 | \n", "96970.273438 | \n", "61069.171875 | \n", "99673.273438 | \n", "109199.875000 | \n", "112307.476562 | \n", "112374.843750 | \n", "||
| S | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "89374.656250 | \n", "NaN | \n", "90181.578125 | \n", "96197.070312 | \n", "... | \n", "69968.835938 | \n", "103632.601562 | \n", "90488.929688 | \n", "113429.859375 | \n", "96970.273438 | \n", "61069.171875 | \n", "99673.273438 | \n", "109199.875000 | \n", "112307.476562 | \n", "112374.843750 | \n", "||
| S | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "89374.656250 | \n", "NaN | \n", "90181.578125 | \n", "96197.070312 | \n", "... | \n", "69968.835938 | \n", "103632.601562 | \n", "90488.929688 | \n", "113429.859375 | \n", "96970.273438 | \n", "61069.171875 | \n", "99673.273438 | \n", "109199.875000 | \n", "112307.476562 | \n", "112374.843750 | \n", "||
| ... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
| MORC2_HUMAN | \n", "MORC2 | \n", "S | \n", "NaN | \n", "NaN | \n", "6817.745605 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "18010.679688 | \n", "12501.521484 | \n", "17377.408203 | \n", "13730.358398 | \n", "... | \n", "23552.466797 | \n", "22144.580078 | \n", "20846.851562 | \n", "24248.417969 | \n", "22490.054688 | \n", "22095.990234 | \n", "25553.849609 | \n", "22250.546875 | \n", "14592.869141 | \n", "19265.998047 | \n", "
| S | \n", "NaN | \n", "NaN | \n", "6817.745605 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "18010.679688 | \n", "12501.521484 | \n", "17377.408203 | \n", "13730.358398 | \n", "... | \n", "23552.466797 | \n", "22144.580078 | \n", "20846.851562 | \n", "24248.417969 | \n", "22490.054688 | \n", "22095.990234 | \n", "25553.849609 | \n", "22250.546875 | \n", "14592.869141 | \n", "19265.998047 | \n", "||
| S | \n", "NaN | \n", "NaN | \n", "6817.745605 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "18010.679688 | \n", "12501.521484 | \n", "17377.408203 | \n", "13730.358398 | \n", "... | \n", "23552.466797 | \n", "22144.580078 | \n", "20846.851562 | \n", "24248.417969 | \n", "22490.054688 | \n", "22095.990234 | \n", "25553.849609 | \n", "22250.546875 | \n", "14592.869141 | \n", "19265.998047 | \n", "||
| NS1BP_HUMAN | \n", "IVNS1ABP | \n", "M | \n", "NaN | \n", "NaN | \n", "38411.285156 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "10104.601562 | \n", "12773.764648 | \n", "10412.311523 | \n", "11411.670898 | \n", "... | \n", "NaN | \n", "17287.406250 | \n", "NaN | \n", "15751.861328 | \n", "14749.724609 | \n", "12410.792969 | \n", "14130.139648 | \n", "NaN | \n", "13198.474609 | \n", "13553.090820 | \n", "
| S | \n", "NaN | \n", "NaN | \n", "38411.285156 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "10104.601562 | \n", "18788.167969 | \n", "10412.311523 | \n", "17367.800781 | \n", "... | \n", "NaN | \n", "17287.406250 | \n", "NaN | \n", "15751.861328 | \n", "14749.724609 | \n", "12410.792969 | \n", "14130.139648 | \n", "7562.620605 | \n", "13198.474609 | \n", "13553.090820 | \n", "
54863 rows × 36 columns
\n", "