Monarch geneset OGS2.0

DPOGS214955
TranscriptDPOGS214955-TA2136 bp
ProteinDPOGS214955-PA711 aa
Genomic positionDPSCF300280 + 179039-187735
RNAseq coverage1011x (Rank: top 13%)
Annotation
HeliconiusHMEL0155960.086.02% 
BombyxBGIBMGA004852-TA0.084.66% 
Drosophilasec23-PD0.075.64% 
EBI UniRef50UniRef50_F5H3650.066.84%Sec23 homolog A (S. cerevisiae) n=11 Tax=Euarchontoglires RepID=F5H365_HUMAN
NCBI RefSeqXP_002013896.10.078.60%GL24389 [Drosophila persimilis]
NCBI nr blastpgi|1951458360.078.60%GL24389 [Drosophila persimilis]
NCBI nr blastxgi|1951458360.078.54%GL24389 [Drosophila persimilis]
Group
Gene OntologyGO:00068867.3e-75intracellular protein transport
GO:00301277.3e-75COPII vesicle coat
GO:00068887.3e-75ER to Golgi vesicle-mediated transport
GO:00082701.6e-23zinc ion binding
KEGG pathwaydpe:Dper_GL243890.0 
 K14006 (SEC23)maps-> Protein processing in endoplasmic reticulum
InterPro domain[127-391] IPR0068967.3e-75Sec23/Sec24, trunk domain
[464-566] IPR0069004e-52Sec23/Sec24, helical domain
[46-129] IPR0068951.6e-23Zinc finger, Sec23/Sec24-type
[578-664] IPR0071235.7e-12Gelsolin domain
[406-449] IPR0129904.4e-09Sec23/Sec24 beta-sandwich
Orthology groupMCL11317 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214955-TA
ATGGCAACTTATGAAGAGTTCATTCAACAAAATGAAGATAGAGATGGCATCAGGTTTACGTGGAACGTATGGGCTTCAAGTAGAATTGAAGCTACAAGGCTTGTTGTCCCTTTAGTTTGTCTTTACCAGCCATTGAAGGAACGACCGGACCTACCTCCTATTCAATATGAACCAGTATTGTGCACACGTAATACTTGTCGCGCAGTACTCAATCCAATGTGCCAAGTGGACTACAGAGCCAAATTGTGGGTCTGCAATTTTTGTTTCCAAAGAAATCCTTTTCCCCCGCAATATGCTGCTATATCAGAACAGCATCAACCGGCGGAGCTCATCCCAAACTTTTCCACCATTGAGTACACCATAACCAGAGCACAAAGCATGCCACCCATTTTTCTGCTTGTGGTTGATACTTGCCTTGATGAAGAAGAACTCGGAGCTCTCAAAGACTCCCTACAAACTTCACTTAGTCTTATGCCTCAAAATGCTCTTGTTGGACTTATTACATTTGGCCGTATGGTACAAATTCATGAATTGGGAACAGAAGGCATATATAAGTGCTATGTTTTCAAGGGCACAAAAGATTTGACAGGTAAACAAATTCAAGAACAATTAGCAATAGGCCGAGTCAATGCACCTAACCCTCAACAGAGACCCGGTACTGCTCCACCTCAGCCACCGGCACATCGGTTCTTGCAACCAGTGAAACAGTGTGACATGGCTCTCACGGATTTATTGAGTGAACTAGGTCGTGACCCTTGGCCATTAGGTGTCGGGAAACGTCCTCTAAGAAGCAGCGGTGTCGCTCTGTCATTAGCTGTGGGAATGTTGGAGGTTACATATCCCAATACTGGTGCTAGAATCATGATGTTCCTTGGAGGACCATGCTCTCAAGGCCCCGGTCAAGTTGTCAATGATGAATTGAAGCAACCAATTCGTTCCCATCATGACATCCATAAGGACAATGCAAAATATATGAAGAAGGCTATCAAACATTATGAAGCTTTGTCTCTTAGGGCTGCTACTAATGGCCATGCCATTGATATTTACTCATGTGCTCTAGACCAGACCGGGTTGATGGAGATGAAACAGTGTTGTAATTCTACAGGTGGACACATGGTAATGGGTGATTCATTCAATTCGTCATTGTTCAAGCAAACATTCCAAAGAGTCTTTGCTAAGGATCAGAAAGGAGATTACAAAATGGCATTCAACGGAACATTGGAAGTGGTAAACCAGCACACAGCGCCCTTACCAGCAGGTGGTCGCGGCTGTGTCCAGCTCATCACGCAGTACCAGCACTCCAGCGGGCAGAGACGCGTCAGGGTCACTACCATTGCTAGAAACTGGGGCGATGCTGCCGTCAACCTTCATCATATATCAGCCGGCTTCGACCAGGAAGCCGCCGCGGTAGTGATGGCCCGCCTCGTGGTGTACCGCGCCGAACAAGAGGACGGACCGGATGTACTGCGCTGGTTGGACCGGATGCTGATACGACTGTGTCAGAAGTTCGGAGAGTATGCTAAGGATGATCCTAACAGTTTCCGTCTGTCAGAGAACTTCAGCCTCTACCCTCAGTTCATGTACCACCTCCGGAGATCACAGTTCCTTCAAGTATTCAATAATTCACCCGATGAAACTACTTTCTACAGACACATGTTAATGCGTGAGGACCTCACCCAGTCCCTCATCATGATCCAGCCCATACTGTACTCCTACAGTTTCGGAGGTCCCCCAGAGCCTGTTCTGCTGGACACTTCCTCCATACAGCCTGACAGGATACTGCTTATGGATACTTTCTTCCAGATCTTGATATACCATGGAGAGACTATAGCCCAATGGAGGGCGCTGCGCTACCAGGACATGGCAGAGTACGAGAGTTTCGCCCAGCTGTTGAGGGCTCCTGTAGACGACGCACAGGATATCCTGCAGAACAGATTCCCCGTCCCTCGGTATATTGACACAGAGCACGGAGGTTCACAGGCGCGATTCCTTTTATCGAAAGTGAATCCCTCAAGGACACACAACAACATGTACGCCTACGGTGGGGACGGTGGCGCCCCCGTGTTGACGGACGATGTTTCACTGCAAGTGTTCATGGAGCACCTCAAAAAGCTGGCTGTCTCATCAACGGCGTAA

Protein sequence:

>DPOGS214955-PA
MATYEEFIQQNEDRDGIRFTWNVWASSRIEATRLVVPLVCLYQPLKERPDLPPIQYEPVLCTRNTCRAVLNPMCQVDYRAKLWVCNFCFQRNPFPPQYAAISEQHQPAELIPNFSTIEYTITRAQSMPPIFLLVVDTCLDEEELGALKDSLQTSLSLMPQNALVGLITFGRMVQIHELGTEGIYKCYVFKGTKDLTGKQIQEQLAIGRVNAPNPQQRPGTAPPQPPAHRFLQPVKQCDMALTDLLSELGRDPWPLGVGKRPLRSSGVALSLAVGMLEVTYPNTGARIMMFLGGPCSQGPGQVVNDELKQPIRSHHDIHKDNAKYMKKAIKHYEALSLRAATNGHAIDIYSCALDQTGLMEMKQCCNSTGGHMVMGDSFNSSLFKQTFQRVFAKDQKGDYKMAFNGTLEVVNQHTAPLPAGGRGCVQLITQYQHSSGQRRVRVTTIARNWGDAAVNLHHISAGFDQEAAAVVMARLVVYRAEQEDGPDVLRWLDRMLIRLCQKFGEYAKDDPNSFRLSENFSLYPQFMYHLRRSQFLQVFNNSPDETTFYRHMLMREDLTQSLIMIQPILYSYSFGGPPEPVLLDTSSIQPDRILLMDTFFQILIYHGETIAQWRALRYQDMAEYESFAQLLRAPVDDAQDILQNRFPVPRYIDTEHGGSQARFLLSKVNPSRTHNNMYAYGGDGGAPVLTDDVSLQVFMEHLKKLAVSSTA-