Monarch geneset OGS2.0

DPOGS202338
TranscriptDPOGS202338-TA1152 bp
ProteinDPOGS202338-PA383 aa
Genomic positionDPSCF300032 + 864941-867054
RNAseq coverage165x (Rank: top 51%)
Annotation
HeliconiusHMEL0205320.084.94% 
BombyxBGIBMGA004835-TA0.082.08% 
DrosophilaCG1116-PA4e-8539.42% 
EBI UniRef50UniRef50_F4WYW71e-10150.13%Retrograde Golgi transport protein RGP1-like protein n=13 Tax=Neoptera RepID=F4WYW7_ACREC
NCBI RefSeqXP_974794.16e-11353.94%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910870611e-11153.94%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910870612e-10853.96%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[188-329] IPR0148485e-35Rgp1
Orthology groupMCL13494 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202338-TA
ATGATTGAGCTCTCAGCCAAACTTACCACTGGCAGTGTTTATTTAGCTGGAGAGGCTTTGGAATGTCGTATATCTTTTTGTCATACAACTCAACCTGAACATAGAAAATCACAAAGCCATAGTGATATCCTGGAAAATTTGGCATGGGCTTCCGCTCAAATACATTGTTTCTTTTCAACATCTAAAAATACCACAGACAAAAATGTTATTGCAGAAAAAACAACAGCTCTCGAAGTAACCTCATGTGATATAGGAGATGTGGTATTTCATACTAAACCTAAGATATTGTTTTGTGACTTAACTATACCACTCGGGGAAACAAAGACTTTTTTGTATCGAGAAACTCTGCCATTAGAAGCACCTCCATCATACAGAGGCACGGCAGTTAAATACTCCTACAAGATCACAATAGCCACACAGAAAGTTGGGTCTCATATCAAAATGGTCCGTTTACCGTTTAGAGTACTACCTATCAGTCCTGTTATGAACTTGCAAGACCTTCCTGCTTTGTGTGAAACAACAGACATACTACAACCAACCAGTCCATTTTCAGAAGGTAGGAAAGTTGAAACTCATCTGACTATGGCCTTACAAGTTTTGCAGAATTTAACAGCCCGAAGAAGCCCAAACTCATACATGATAACCAATGGCAGGGGGAAAGTAGGTCGTTTCTGCCTATTCAAATCTGCATACAGATTAGGTGAAGACATTGTTGGTACATTTGATTTCTCTGTAGGAACTATAACATGCATGCAGGTGTCTGTATCACTACAACCTGAAGAAGTACTAAAATCTAAGACGCCTTCAAAGAATGTAAACAAAGAAAGCTGCAGTAGATCAATGACAGTCGCAAGGTATCATGAGGTGACTTTGGGACTTACACACTCGCAGCTTATCTTACCCATACCACTACATATAACTCCGGCCTTCGAAATTGATGAGGTATCTCTGAATTGGCGCCTACATTTTGAATTTGTGTTAAGTCAAGATAAGCTATTACCCAATCCGGAGGACAAAGATTGGAATGCACCACTCAATGTGCCAATAGAAACCATGGTCTGGAATCTTCCTGTAAAAATTTACACGACACTTCCCAAACAAATAGCCCAACAATTCCATGGGAACGACAACTATACTATGTTTATAAAATAA

Protein sequence:

>DPOGS202338-PA
MIELSAKLTTGSVYLAGEALECRISFCHTTQPEHRKSQSHSDILENLAWASAQIHCFFSTSKNTTDKNVIAEKTTALEVTSCDIGDVVFHTKPKILFCDLTIPLGETKTFLYRETLPLEAPPSYRGTAVKYSYKITIATQKVGSHIKMVRLPFRVLPISPVMNLQDLPALCETTDILQPTSPFSEGRKVETHLTMALQVLQNLTARRSPNSYMITNGRGKVGRFCLFKSAYRLGEDIVGTFDFSVGTITCMQVSVSLQPEEVLKSKTPSKNVNKESCSRSMTVARYHEVTLGLTHSQLILPIPLHITPAFEIDEVSLNWRLHFEFVLSQDKLLPNPEDKDWNAPLNVPIETMVWNLPVKIYTTLPKQIAQQFHGNDNYTMFIK-