Monarch geneset OGS2.0

DPOGS214206
TranscriptDPOGS214206-TA1158 bp
ProteinDPOGS214206-PA385 aa
Genomic positionDPSCF300014 + 238992-243644
RNAseq coverage326x (Rank: top 35%)
Annotation
HeliconiusHMEL0068120.088.31% 
BombyxBGIBMGA006215-TA9e-9085.71% 
DrosophilaCG7011-PA1e-11254.21% 
EBI UniRef50UniRef50_C1BMS55e-11553.14%Endoplasmic reticulum-Golgi intermediate compartment protein 3 n=1 Tax=Caligus rogercresseyi RepID=C1BMS5_9MAXI
NCBI RefSeqXP_974331.22e-14662.18%PREDICTED: similar to AGAP012144-PA [Tribolium castaneum]
NCBI nr blastpgi|1892378213e-14562.18%PREDICTED: similar to AGAP012144-PA [Tribolium castaneum]
NCBI nr blastxgi|1892378211e-13962.18%PREDICTED: similar to AGAP012144-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[146-365] IPR0129361.8e-74Domain of unknown function DUF1692
Orthology groupMCL13624 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214206-TA
ATGGCTTCTCAAATAATTGGTAAATTTAAACAATTGGATGCGTATGCTAAAACATTAGAAGATTTTAGAGTTAAAACAGCGACTGGTGCAATAATTACAGTCACGGGTGCGTTTGTAATGATTTTACTGATAGTTTTGGAGTTGCACACGTATATGTCTCCTAATATATCAGAAGAGCTATTTGTGGACACCTCAAGAGGTCATAAATTAAGGATTAATTTCGATATTGTTGTTCCCAGGATATCCTGTGATTATTTGGTATTGGATGCTATGGACTCATCAGGTGAACAGCATCTGCAGATGGACCACAATGTACACAAAAGAAGATTAGATTTGGATGGTGTTCCTATAAAAGAACCAATAAAGGAGGACATATCCCTTTCATCAACAGTTAAACAAAATAGTTCAGAAATAGCCATAGTTACATGTGGGAGTTGTTATGGAGCAGCATTTAATGATTCACAATGTTGTAACACTTGCGAAGATGTTAAAGAAGCATATAGATTAAGACGATGGGCTCTGCCAGATCTAGCAACTGTAGAACAATGTAAGGATGATGATTCATTAGAAAGAACTAATCTAGCTCTCAAAGAAGGATGTCAGATCTATGGTTATATGGAAGTAAATAGGGTAGGAGGAAGTTTCCACATAGCACCAGGTAAAAGTTTCACAATCAATCATGTGCATGTACATGATGTCCAGCCATTCTCTTCATCAGTTTTTAACACTACCCATATTATAAGGCACCTATCATTTGGTTCAGACATTGAAAGTGCAAATACAGCTCCCTTGGATGGAATAACAGGTTTAGCCAAAGAAGGTGCTGTTATGTTTCAGTATTATTTAAAAATAGTTCCAACAATGTATGTAAAACTGGATGGTACCATTTTACATACTAATCAGTTTTCAGTGACAAGACATCAGAAGTCAGTATCCAATATAAATGTTGAATCCGGAATGCCGGGTGCATTCTTTAGTTATGAATTGTCACCACTCATGGTTAAATATACAGCAAAAGGAAGGTCTATAGGCCACTTTGCCACAAATGTCTGTGCAATTGTAGGTGGAGTGTTCACTGTTGCTGGAATTTTTGATACCCTACTGTATCATTCACTTAATGCATTTCAAAACAAAGTTGTCCTTGGTAAAGCCGGTTAA

Protein sequence:

>DPOGS214206-PA
MASQIIGKFKQLDAYAKTLEDFRVKTATGAIITVTGAFVMILLIVLELHTYMSPNISEELFVDTSRGHKLRINFDIVVPRISCDYLVLDAMDSSGEQHLQMDHNVHKRRLDLDGVPIKEPIKEDISLSSTVKQNSSEIAIVTCGSCYGAAFNDSQCCNTCEDVKEAYRLRRWALPDLATVEQCKDDDSLERTNLALKEGCQIYGYMEVNRVGGSFHIAPGKSFTINHVHVHDVQPFSSSVFNTTHIIRHLSFGSDIESANTAPLDGITGLAKEGAVMFQYYLKIVPTMYVKLDGTILHTNQFSVTRHQKSVSNINVESGMPGAFFSYELSPLMVKYTAKGRSIGHFATNVCAIVGGVFTVAGIFDTLLYHSLNAFQNKVVLGKAG-