Monarch geneset OGS2.0

DPOGS207429
TranscriptDPOGS207429-TA1146 bp
ProteinDPOGS207429-PA381 aa
Genomic positionDPSCF300087 + 448618-449763
RNAseq coverage159x (Rank: top 52%)
Annotation
HeliconiusHMEL0054440.087.47% 
BombyxBGIBMGA009329-TA0.078.44% 
DrosophilaCG4293-PB9e-7639.48% 
EBI UniRef50UniRef50_F4WZ145e-12058.68%Endoplasmic reticulum-Golgi intermediate compartment protein 2 n=8 Tax=Coelomata RepID=F4WZ14_ACREC
NCBI RefSeqXP_966630.24e-11853.44%PREDICTED: similar to AGAP005044-PA [Tribolium castaneum]
NCBI nr blastpgi|3071880572e-12059.28%Endoplasmic reticulum-Golgi intermediate compartment protein 2 [Camponotus floridanus]
NCBI nr blastxgi|3071880576e-11760.06%Endoplasmic reticulum-Golgi intermediate compartment protein 2 [Camponotus floridanus]
Group
KEGG pathway 
InterPro domain[167-334] IPR0129362e-43Domain of unknown function DUF1692
Orthology groupMCL15116 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207429-TA
ATGATAAGATATCGTGGCAAAAAGAAGGTAATAGACAAGGTGAAAGAATTAGATGCTTTTTCAAAGGTTCCTGATGAATACGTGGACAGTACGCCGGTAGGAGGAACATTTTCAATTATAACGTTTTTTATTATCATGTGGCTCGTTTATAGTGAAGTTTCATACTTTCTAGACAGCAATCTGGCTTTCCGGTTTATGCCAGACACAGATATGGATGAAAAGTTAAGAATAAATATAGATATCACCATTGCAATGCCTTGCTCCAATATTGGGGCTGATATTTTGGACTCTACTTCACAAAGTGTGTTTGGCTTCGGTGAATTGCAGGAAGAAGATACCTGGTGGGAACTTACACCAGAGCAAAAGAATGCCTTTGAAGCAGTGAAATATATGAACTCTTATTTGAGAGAGGAATATCATTCTGTGTGGCAATTACTATGGAAAAAAGGTCATGGGTCTGTCAGAGCTACTGTTCCCCCTCGGAAAACTAAACCTAATCGGCGACCAGACGCCTGTAGACTACATGGAGTGCTCACATTGAATAAGGTTGCTGGAAACTTCCATATAACAGCCGGTAAAAGTTTACATTTACCAAGAGGGCACATACACTTAAATATGCTCTTTGATGACACCCCACAAAATTTTAGCCACAGAATAAATAGACTGAGTTTTGGCAGTCCTGCCAATGGTATTATATATCCATTGGAAGGAGATGAGAAAATTACTTCAGATGAGAGCATGTTATATCAGTATTTTCTAGAAGTTGTGCCAACTGATGTCGACACTACATTCGAATCTATCAAGACCTTCCAGTACTCCGTCAAAGAACTGGCACGGCCTATCAGCCATAGCAAAGGCTCACATGGCGTGCCGGGAGTGTTCTTCAAATATGATATGGCAGCATTGAAGGTACAAGTCTACCAAGAAAGGGAAAATTTACTGCAATTTATGTTGCGTCTATTCTCTATAATTGGTGGCATTTATGTCATAATTAGTTTCATTAACACTATAGTTCTTACCGCTAAGACATTGCTAGTTAAGAAGCCAGAAGTTAAGAAGAATGAAGATTCTTCACCCAAGTATATGAAGAAGAATCTCTTGTTGACAACACCTGATCTCATCCCGTTGGACTTATCAAATCAATGA

Protein sequence:

>DPOGS207429-PA
MIRYRGKKKVIDKVKELDAFSKVPDEYVDSTPVGGTFSIITFFIIMWLVYSEVSYFLDSNLAFRFMPDTDMDEKLRINIDITIAMPCSNIGADILDSTSQSVFGFGELQEEDTWWELTPEQKNAFEAVKYMNSYLREEYHSVWQLLWKKGHGSVRATVPPRKTKPNRRPDACRLHGVLTLNKVAGNFHITAGKSLHLPRGHIHLNMLFDDTPQNFSHRINRLSFGSPANGIIYPLEGDEKITSDESMLYQYFLEVVPTDVDTTFESIKTFQYSVKELARPISHSKGSHGVPGVFFKYDMAALKVQVYQERENLLQFMLRLFSIIGGIYVIISFINTIVLTAKTLLVKKPEVKKNEDSSPKYMKKNLLLTTPDLIPLDLSNQ-