Tutorial_ProteoRE_Biomarkers_Candidate_Identification


StepAnnotation
Step 1: Build tissue-specific expression dataset
RNA levels based on RNA-seq experiments
/home/proteore/galaxy/tool-data/protein_atlas/HPA_RNA_tissue_17-06-2021.tsv
heart muscle
Step 2: Build tissue-specific expression dataset
Expression profiles based on immunohistochemistry
/home/proteore/galaxy/tool-data/protein_atlas/HPA_normal_tissue_17-06-2021.tsv
heart muscle
High Medium
Enhanced Supported
Step 3: Filter by keywords and/or numerical value
Output dataset 'output' from step 1
True
Discard
OR
Filter by keywords
Filter by numerical values
Filter by numerical value 1
c4
<=
10.0
Filter by range of numerical values
False
Step 4: Venn diagram
List to compares
List to compare 1
Input file containing your list
Output dataset 'kept_lines' from step 3
True
c1
Heart RNAseq
List to compare 2
Input file containing your list
Output dataset 'output' from step 2
True
c1
Heart IHC
Step 5: Cut
Output dataset 'output_text' from step 4
Keep
Tab
fields
3
Step 6: Filter by keywords and/or numerical value
Output dataset 'output' from step 5
True
Discard
OR
Filter by keywords
Filter by keywords 1
c1
True
copy/paste
NA
Filter by numerical values
Filter by range of numerical values
False
Step 7: Add expression data
/home/proteore/galaxy/tool-data/protein_atlas/HPA_full_atlas_17-06-2021.tsv
Input file containing your IDs
Output dataset 'kept_lines' from step 6
c1
True
RNAseq/Ab-based expression data:
Gene name Gene description RNA tissue category RNA non-specific tissue abundance in 'Transcript Per Million' (only for 23/10/2018 release)
Step 8: Filter by keywords and/or numerical value
Output dataset 'output' from step 7
True
Keep
OR
Filter by keywords
Filter by keywords 1
c4
False
copy/paste
enriched enhanced
Filter by numerical values
Filter by range of numerical values
False
Step 9: ID Converter
Input file containing IDs
Output dataset 'kept_lines' from step 8
True
c1
Human (Homo sapiens)
/home/proteore/galaxy/tool-data/id_mapping/Human_id_mapping_16-06-2021.tsv
Ensembl gene ID (e.g. ENSG00000166913)
Target type:
UniProt accession number (e.g. P31946) - reviewed entries only
Step 10: Add protein features
Input file containing your IDs
Output dataset 'output' from step 9
c5
True
UniProt accession number
Number of transmembrane domains Subcellular Location Disease information
tool-data/nextprot_ref_31-07-2020.tsv
Step 11: Filter by keywords and/or numerical value
Output dataset 'output' from step 10
True
Keep
AND
Filter by keywords
Filter by keywords 1
c7
False
copy/paste
cytoplasm cytosol
Filter by numerical values
Filter by numerical value 1
c6
=
0.0
Filter by range of numerical values
False
Step 12: Get MS/MS observations in tissue/fluid
Input file containing your IDs
Output dataset 'kept_lines' from step 11
True
c5
tool-data/Human_Heart_2014-08.tsv tool-data/Human_Plasma_non_glyco_2017-04.tsv
Step 13: Filter by keywords and/or numerical value
Output dataset 'output' from step 12
True
Discard
OR
Filter by keywords
Filter by keywords 1
c10
True
copy/paste
NA
Filter by numerical values
Filter by range of numerical values
False