Monarch geneset OGS2.0

DPOGS200385
TranscriptDPOGS200385-TA1587 bp
ProteinDPOGS200385-PA528 aa
Genomic positionDPSCF300852 - 1321-6526
RNAseq coverage267x (Rank: top 40%)
Annotation
HeliconiusHMEL0042310.082.77% 
BombyxBGIBMGA013343-TA0.067.73% 
Drosophilagho-PA6e-17654.43% 
EBI UniRef50UniRef50_Q9VQ949e-17454.43%CG10882 n=30 Tax=Coelomata RepID=Q9VQ94_DROME
NCBI RefSeqXP_971886.10.058.54%PREDICTED: similar to Sec24B protein, putative [Tribolium castaneum]
NCBI nr blastpgi|910946470.058.54%PREDICTED: similar to Sec24B protein, putative [Tribolium castaneum]
NCBI nr blastxgi|910946470.058.54%PREDICTED: similar to Sec24B protein, putative [Tribolium castaneum]
Group
Gene OntologyGO:00068869.4e-48intracellular protein transport
GO:00301279.4e-48COPII vesicle coat
GO:00068889.4e-48ER to Golgi vesicle-mediated transport
KEGG pathwaytca:6605710.0 
 K14007 (SEC24)maps-> Protein processing in endoplasmic reticulum
InterPro domain[3-180] IPR0068969.4e-48Sec23/Sec24, trunk domain
[281-386] IPR0069004e-29Sec23/Sec24, helical domain
[187-269] IPR0129903.3e-20Sec23/Sec24 beta-sandwich
Orthology groupMCL10886 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200385-TA
GGTACCCTGGCGCAGCCCGCAATGCTCTCAGTGGGCGACACCGCGGACATGTTCGTGCCACTGCTAGAAGGATTCCTGTCCACCGCGGAGGACAGCGGGCCGGTGTTGACGTCACTGCTGCAACAGCTGCCGCAGGTGTTCCAGAACAACAAGGACACGGAGACCGTGCTGCTGCCGGCCGTCCAGGCTGGCTACGAGGCTCTGAAGGCCGCTGACACTTCCGGCCAACTGCTAGTCTTCCACACATCGCTGCCCACATACAACGCGCCCGGGAAACTCATCAACAGGGAAGACAGGAAGTTACTCGGCACGGATAAGGAAAAACAGATACTTTCCCCTCAGAGCACGTCATACAATGAGCTGGGGCAGCTGTGCAGCGCTGCAGGCGTGTGTGTCCAGATGTTCGTGTGTAACAACTCGTACGTCGACTGCGCCACCATCGGCCAGCTGGCCAGGCTCACCGGCGGACAAGTACACAAGTACACGTACTTCACATCGGACACGGACGGCCACCGTCTGATGTGGGACGTGTCGCGCGTGCTGTCCCGCCCGACCGCTCACGACGCGGTCATGCGCGTGAGGACCAGCACCGGCGTGCGCCCCACAGACTTCTACGGACACTTCTTCATGTCCAACACCACGGACGTCGAGCTGGCCGCCATCGACTCGGACAAGGCCATCGGCGTGGAGATCAAACACGACGACAAGTTGACGGCGGAGTCCGGCGTGTACATCCAGGCGGCGCTGCTGTACACCCACCGCTCGGGCCAGCGGAGGCTGAGGGTCATCAACCTCGCGCTGTCGCTCGCCCACCAGCTGGCGGATGTGTACAGATCCGCGGAACTGGACACCATCGTAAACTTCCTCACTAAACAAGCCGTGTGGGCGCTCCGTGAGGCTACGCCCCGTCAGGTCCGCGAGGGCCTCACAAGTCGCTGTGCTCGTTCGCTGGCCGCCTACCGACGTCACTGCGCCTCGCCCTCGTCCGCCGGACAGCTGGTGTTGCCGGAAGCCATGAAACTACTTCCACTATACACCAGCTGTGTGCTGCGGTCTGATGCTGTCGGCGGTGGGCCGGACATAACGTGCGACGACCGCTCGTGCGCCATGTACCGCGCGCTCACGGCGGACGTGTCCCTGTCGCTCGTGTACACCTACCCTCGCCTGCTGCCGCTGCACGTGCTGCCCGACCAGGAGCCCGCCCCGCTCAGGGCCTCCATAGACAAGATGTCCGAACACGGAGTCTATTTGTTGGAGAACGGAGTCCACATGTTGATATGGGTGGGGTCCCAAGCGCCGCTGGAGTTCGTGAGGGATGTGTTCGGAGCGAACTCGCCGCAGGCCGTAGACGCCCGGGTGTGCGAGCTACCGGAAATAGACTCGCGAGTCGGCGCAGCGGTGCGCAGGCTCGTGGACGACACCAGGCATAAGAGGAGGAACGCCATGAGGCTAACCGTATTACGGCAGCACGACAAGCTGGAGACGGTGCTGCGTCAGCTTTTAGTGGAGGATCGGGGGGTAGACGGCGGGGCTTCCTACGTCGACTACCTCTGCCATATACACAAGGAGATACGCGCCCTACTCTAG

Protein sequence:

>DPOGS200385-PA
GTLAQPAMLSVGDTADMFVPLLEGFLSTAEDSGPVLTSLLQQLPQVFQNNKDTETVLLPAVQAGYEALKAADTSGQLLVFHTSLPTYNAPGKLINREDRKLLGTDKEKQILSPQSTSYNELGQLCSAAGVCVQMFVCNNSYVDCATIGQLARLTGGQVHKYTYFTSDTDGHRLMWDVSRVLSRPTAHDAVMRVRTSTGVRPTDFYGHFFMSNTTDVELAAIDSDKAIGVEIKHDDKLTAESGVYIQAALLYTHRSGQRRLRVINLALSLAHQLADVYRSAELDTIVNFLTKQAVWALREATPRQVREGLTSRCARSLAAYRRHCASPSSAGQLVLPEAMKLLPLYTSCVLRSDAVGGGPDITCDDRSCAMYRALTADVSLSLVYTYPRLLPLHVLPDQEPAPLRASIDKMSEHGVYLLENGVHMLIWVGSQAPLEFVRDVFGANSPQAVDARVCELPEIDSRVGAAVRRLVDDTRHKRRNAMRLTVLRQHDKLETVLRQLLVEDRGVDGGASYVDYLCHIHKEIRALL-