Monarch geneset OGS2.0

DPOGS203768
TranscriptDPOGS203768-TA2196 bp
ProteinDPOGS203768-PA731 aa
Genomic positionDPSCF300010 + 526926-533880
RNAseq coverage160x (Rank: top 52%)
Annotation
HeliconiusHMEL0042310.066.91% 
BombyxBGIBMGA013343-TA0.070.26% 
Drosophilagho-PA1e-12550.95% 
EBI UniRef50UniRef50_Q9VQ942e-12350.95%CG10882 n=30 Tax=Coelomata RepID=Q9VQ94_DROME
NCBI RefSeqXP_971886.12e-14154.69%PREDICTED: similar to Sec24B protein, putative [Tribolium castaneum]
NCBI nr blastpgi|910946473e-14054.69%PREDICTED: similar to Sec24B protein, putative [Tribolium castaneum]
NCBI nr blastxgi|910946472e-13754.69%PREDICTED: similar to Sec24B protein, putative [Tribolium castaneum]
Group
Gene OntologyGO:00068864e-29intracellular protein transport
GO:00301274e-29COPII vesicle coat
GO:00068884e-29ER to Golgi vesicle-mediated transport
GO:00082703.2e-13zinc ion binding
KEGG pathwaytca:6605715e-141 
 K14007 (SEC24)maps-> Protein processing in endoplasmic reticulum
InterPro domain[467-572] IPR0069004e-29Sec23/Sec24, helical domain
[373-455] IPR0129905.3e-20Sec23/Sec24 beta-sandwich
[282-366] IPR0068966.7e-17Sec23/Sec24, trunk domain
[26-65] IPR0068953.2e-13Zinc finger, Sec23/Sec24-type
Orthology groupMCL10886 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203768-TA
ATGGCAGAGACTATCGGCACGGAACAGGAGCCCCCCCTGCTGGACTTCGCAGCCCTGACGGGGTCCCCCACCATGGGCCCGGTGAGATGCTGTCGCTGCAAGGCCTACATGTGTCCCAACATGAAGTTCATAGACGGCGGCAGACACTTCAAGTGCGCCTTCTGCAAGGCCACCAGCGAAGTGCCGATGGAGTACACGCAGTACATCACCAGCATGCAGCAGTACGGTCGCGTGCCGCCCGAGATGGCGCTCGGCACTTACGAGATAGTCGCCACCAAGGAATACTGTCGGAACAACACCCTTCCTAATCCTCCAGCTATAGTGTTCGTGATCGACGTCTCATACAACTCTATAAAGAGCGGCCTGCTGCAAACTATATGTGACAATATACTGGAGATAATACAGACACCACTGACGGACGGTGAAGGAAAACATCGTGAGGAAACCTGGTCTTATAATTACAAATCGCCAATCCGTCTTGAACAAGCTTTGTTGTGCAAACGAAACAAGCAAGCAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAACAAAAGAAATCCCCTCAGAGCACGTCGTACAATGAGCTGGGGCAGCTGTGCAGCGCTGCAGGCGTGTGTGTCCAGATGTTCGTGTGTAACAACTCGTACGTCGACTGCGCCACCATCGGCCAGTTGGCGAGGCTCACCGGCGGACAAGTACACAAGTACACGTACTTCACTTCGGACACGGACGGCCACCGTCTGATGTGGGACGTGTCGCGCGTGCTGTCCCGCCCGACCGCTCACGACGCGGTCATGCGCGTGAGGACCAGCACCGGCGTGCGCCCCACAGACTTCTACGGACACTTCTTCATGTCCAACACCACGGACGTCGAGCTGGCCGCCATCGACTCGGACAAGGCCATCGGCGTGGAGATCAAACACGACGACAAGTTGACGGCGGAGTCCGGCGTGTACATCCAGGCGGCGCTGCTGTACACCCACCGCTCGGGCCAGCGGAGGCTGAGGGTCATCAACCTCGCGCTGTCGCTCGCCCACCAGCTGGCGGATGTGTACAGATCCGCGGAACTGGACACCATCGTAAACTTCCTCACTAAACAAGCCGTGTGGGCGCTCCGTGAGGCTACGCCCCGTCAGGTCCGCGAGGGCCTCACAAGTCGCTGTGCTCGTTCGCTGGCCGCCTACCGACGTCACTGCGCCTCGCCCTCGTCCGCCGGACAGCTGGTGTTGCCGGAAGCCATGAAACTACTTCCACTATACACCAGCTGTGTGCTGCGGTCTGATGCTGTCGGCGGTGGGCCGGACATAACGTGCGACGACCGCTCGTGCGCCATGTACCGCGCGCTCACGGCGGACGTGTCCCTGTCGCTCGTGTACACCTACCCTCGCCTGCTGCCGCTGCACGTGCTGCCCGACCAGGAGCCCGCCCCGCTCAGGGCCTCCATAGACAAGATGTCCGAACACGGAGTCTATTTGTTGGAGAACGGAGTCCACATGTTGATATGGGTGGGGTCCCAAGCGCCGCTGGAGTTCGTGAGGGATGTGTTCGGAGCGAACTCGCCGCAGGCCGTAGACGCCCGGGTGTGCGAGCTACCGGAAATAGACTCGCGAGTCGGCGCAGCGGTGCGCAGGCTCGTGGACGACACCAGGCATAAGAGGAGGAACGCCATGAGGCTAACCGTATTACGGCAGCACGACAAGCTGGAGACGGTGCTGCGTCAGCTTTTAGTGGAGGATCGGGGGGGTAGACGGCGGGGCTTCCTACGTCGACTACCTCTGCCATATACACAAGGAGATACGCGCCCTACTCTAGCGGCCCACGCTTTGGGAACACCTGTGGAAGTCCTTAAGATATCTGTCTGA

Protein sequence:

>DPOGS203768-PA
MAETIGTEQEPPLLDFAALTGSPTMGPVRCCRCKAYMCPNMKFIDGGRHFKCAFCKATSEVPMEYTQYITSMQQYGRVPPEMALGTYEIVATKEYCRNNTLPNPPAIVFVIDVSYNSIKSGLLQTICDNILEIIQTPLTDGEGKHREETWSYNYKSPIRLEQALLCKRNKQANKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQTNKQKKSPQSTSYNELGQLCSAAGVCVQMFVCNNSYVDCATIGQLARLTGGQVHKYTYFTSDTDGHRLMWDVSRVLSRPTAHDAVMRVRTSTGVRPTDFYGHFFMSNTTDVELAAIDSDKAIGVEIKHDDKLTAESGVYIQAALLYTHRSGQRRLRVINLALSLAHQLADVYRSAELDTIVNFLTKQAVWALREATPRQVREGLTSRCARSLAAYRRHCASPSSAGQLVLPEAMKLLPLYTSCVLRSDAVGGGPDITCDDRSCAMYRALTADVSLSLVYTYPRLLPLHVLPDQEPAPLRASIDKMSEHGVYLLENGVHMLIWVGSQAPLEFVRDVFGANSPQAVDARVCELPEIDSRVGAAVRRLVDDTRHKRRNAMRLTVLRQHDKLETVLRQLLVEDRGGRRRGFLRRLPLPYTQGDTRPTLAAHALGTPVEVLKISV-