Monarch geneset OGS2.0

DPOGS209867
TranscriptDPOGS209867-TA1311 bp
ProteinDPOGS209867-PA436 aa
Genomic positionDPSCF300302 - 128266-150732
RNAseq coverage1114x (Rank: top 11%)
Annotation
HeliconiusHMEL0075366e-4061.29% 
BombyxBGIBMGA004421-TA9e-7551.32% 
DrosophilaCG3074-PB8e-9643.86% 
EBI UniRef50UniRef50_A0MA794e-11349.77%TIN-ag-RP n=1 Tax=Bombyx mori RepID=A0MA79_BOMMO
NCBI RefSeqNP_001116812.17e-11449.77%tubulointerstitial nephritis antigen [Bombyx mori]
NCBI nr blastpgi|1825092021e-11249.77%tubulointerstitial nephritis antigen precursor [Bombyx mori]
NCBI nr blastxgi|1825092027e-12049.77%tubulointerstitial nephritis antigen precursor [Bombyx mori]
Group
Gene OntologyGO:00082341.1e-117cysteine-type peptidase activity
GO:00065081.4e-57proteolysis
KEGG pathwaybfo:BRAFLDRAFT_2472641e-37 
 K01363 (CTSB)maps-> Lysosome
    Antigen processing and presentation
InterPro domain[100-425] IPR0131281.1e-117Peptidase C1A, papain
[186-426] IPR0006681.4e-57Peptidase C1A, papain C-terminal
Orthology groupMCL13474 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209867-TA
ATGATGAATGTATTATATTCGCTGCTGTTCTGTGGGCTGGTGAGCGTAACCACGGCGTACTGGCGCCCGGGACTGCCGCCGGGGCCGTACTGCGGCATCAACAACCAGTGCTGCACCGATCGCAAGGATGACTGCTCACATCGGATACGAGATACTCTATGTTACTGCGATCAATTCTGTAACCGCACTCACGACGATTGCTGTCCTGATTACGAGGAAGTTTGCCTCGGGAAACCCTCGAACATTCTGGAGCCGTGCAAGCATAACGGCAAGTTGTATTTCAAGGGGGACAAGCGAATGGATAACTGTAATACATGTGAATGCGTCCAAGACCCCTACACCAACCAGCCTCAGTGGAGCTGTGAACGCGACGCGTGCATCATCAGTGATGACGTCATCTATGGTGTCAACAGAGGGAACAGCTGGAGGGCCTACAACTATACTCAGTTCTACGGAAAGAAGCTGAGAGACGGGATCATATATAAGCTAGGTACAATGCCATTGAGCCACGAAACAAGACGCATGGGTCCGATCAGATACGACAAGGATATACCGTATCCAAGGGATTTCGACGCTCGCCGTCGCTGGCCAAACTTCATCTCGCCGGTGTTAGATCAAGGATGGTGTGGCTCGGACTGGGCGGTCACCATAGCTACCGTCGCCTCTGATAGGTTCGCGATCCAGTCGAACGGCGCTGAGAGGATGGTGCTGTCCCCTCAGGTGCTTCTCTCTTGTAACATCAGACGTCAGCAGGGCTGTCGCGGCGGCCATATCGACGTAGCCTGGAACTTCGCCAGAGGCCACGGTCTCGTCGACGAGGAATGCTTTCCTTACAAAGCCGCGACTACCAGCTGTCCCTTCAGACCGAAAGCTAATCTCATAGGTATTAAAAATATACAATCCGTGTGTATAGAGGACGGTTGCCGGCCTCCGGTCCGCCAAAGAACCTCCCGCTACAAGGTGGGTCCTCCCGGGAAACTCGCCACAGAAAACGACATCATGTACGACATCATGGAGTCCGGGCCAGTCCACGCCGTAATGACGGTACACCAGGACTTTTTCCACTACCACGATGGTATCTACCGCCGTTCTCCGTACGGTGACAACACCCTTCAGGGCTTGCATAGCGTCAGGATCGTGGGTTGGGGAGAAGACAGAGGAGATAAATACTGGGTGGTTGCCAACAGCTGGGGCTGTGACTGGGGTGAGAACGGCTACTTCCGTATAGCGCGTGGCAGCAACGAGTCCGGCATCGAGTCGTTCGTGGTCACCGTCCTCAGTGACGTCACTGAGGCCTACCAAAAGAAATAA

Protein sequence:

>DPOGS209867-PA
MMNVLYSLLFCGLVSVTTAYWRPGLPPGPYCGINNQCCTDRKDDCSHRIRDTLCYCDQFCNRTHDDCCPDYEEVCLGKPSNILEPCKHNGKLYFKGDKRMDNCNTCECVQDPYTNQPQWSCERDACIISDDVIYGVNRGNSWRAYNYTQFYGKKLRDGIIYKLGTMPLSHETRRMGPIRYDKDIPYPRDFDARRRWPNFISPVLDQGWCGSDWAVTIATVASDRFAIQSNGAERMVLSPQVLLSCNIRRQQGCRGGHIDVAWNFARGHGLVDEECFPYKAATTSCPFRPKANLIGIKNIQSVCIEDGCRPPVRQRTSRYKVGPPGKLATENDIMYDIMESGPVHAVMTVHQDFFHYHDGIYRRSPYGDNTLQGLHSVRIVGWGEDRGDKYWVVANSWGCDWGENGYFRIARGSNESGIESFVVTVLSDVTEAYQKK-