Monarch geneset OGS2.0

DPOGS201800
TranscriptDPOGS201800-TA1797 bp
ProteinDPOGS201800-PA598 aa
Genomic positionDPSCF300145 + 25468-55814
RNAseq coverage551x (Rank: top 23%)
Annotation
HeliconiusHMEL0070501e-8085.34% 
BombyxBGIBMGA013228-TA2e-17883.50% 
DrosophilaCG32137-PB4e-7843.12% 
EBI UniRef50UniRef50_D2A6E02e-9045.68%Putative uncharacterized protein GLEAN_15685 n=2 Tax=Tribolium castaneum RepID=D2A6E0_TRICA
NCBI RefSeqXP_968757.22e-9045.58%PREDICTED: similar to AGAP001308-PA [Tribolium castaneum]
NCBI nr blastpgi|2700090526e-9045.68%hypothetical protein TcasGA2_TC015685 [Tribolium castaneum]
NCBI nr blastxgi|2700090522e-9645.66%hypothetical protein TcasGA2_TC015685 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL13136 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201800-TA
ATGGTGTGTGAAATTAAAACTCCATTAAAACATGCTGGCCCAGAGCCCCGTAAAGCACGACGCAGATGCTTCACAGTCCGCTGCCCCAAACCGATTCCCACAACTGTTTCCGTCACATGTCTGAACAACTACTATTCGGTTAATAATTTATTGTTTATTTATCTGTCGCTGGAAATTATATATCTCGAGTGGTTTCCACACGCCGAGGACGCGGTTCGTGATATTTCGTGGCCTGATAGAGACGTGATAACGTTCGAGTACTGGATTCACAAGGCGCAGTTTACTGTAAGGTTACTGGAAACAGAATGGGGTCGTTGTCATGAAAAAAGTTTAGTTCACACTGACGGATACAAAAGAATTGCGAGAAGATGTACGCTATTGTTTGTTTACAAGAATCAAGAAAATAAGCCACTTTACATCAACTCCATGGTTCACATATCCACGAGCATCCAGGTCACGTCGATGGTCCTTATCTCGGAGTTGGAGCAAGAAAAGCATCTCCTGCGACGACGTTTGGACACCGAGCAAGGGGAATACGAAGCCAGGCTTGTGGAACTACAGAATGACATCAAAGAACTCACCGCCAAGATCGACTCCAAGGACAATTCAGTTAAACAAAGAGAAGAAGAAAAGACGGGTCTTATAGCAGAGTTGACAGCACAGAATTCTCGTCTAACTAATCAACTGAAGGAGTCATCTGCTGTCGAGGCGCAGCTCTTAGCGCAACTGGAGTTGCTCAAAGATCAGTGCTCCATAAGAAAGACAAGCTTGCAAGACCATGTCCAAAGCTTGGAATCCCTCAAAGCTGAGCTGGCTCTCATGAGTGATAAAAGGGCGGATTTGGAAAGACGGCTGACCACATCGCTAAAAGACAAAGACAGCTTAACACAGCAATTAGATGAAGCTAACGACAGGATCTCAGCCCTGGAGAGGCAGTTGAAGGAACAGGAACATCTATACCAGAACACGCTCAAGGAGTTGGAGCGTCTACAGAGATCTCACGACACGCTGGCAGAAAGAGTTGGATCTGATCCGGTGGAAATTACGAACACTCCGAGGTCCTTGCACGCGGAACTGGAATCGGAACCGGAAGAAGATGAGAACTGGCTAAGAACAGAGGCTGTTCAGGTCTTCAAGCAGTTGAGGGCATTAGCCCTCCAACTGAACACGGGCCACGACGATGATTCCGGTCTACATTCAGATCTATCTTTGTCGTCTCTCGATGGTGATGAAGGGGAGACTCTCCGTCGTGGAGCACTGTCCGCCGCTTGTGCTGATGCCGTTGCAGCGTATGCAGCATTAGAGGGATCCAGAGTGAGGGACTCCATCGCCTCCCACGCGCGTCGTGCTATGGAGAGAGAGAGACAGATTGATGAAAAGAATGAGATCATAGCGGAACTGTCGTCCAAGCTGTCAGTGGCGGAAGTTGAACTGCGAGCGTCAGCTGACGAGAGAGATAAGCTGCTGAACGACGCGACATACAGTAGCTTACAGCATGATGAAGCTGTCACCAAAGCCAGGCAGGAGAGAGATGAAGCTATAGAGAGGAAAAAGGCCAGCGAGGTCGCTCTGGCTAAGACACGCGTAGAATTGATGCAGGCTAACAGCCAGCTGTACGAGGCGGTGAGACAGAAGATAGACCTGGGCCAACAGCTGGAGCAGTGGCAGATGGACATGCAGGAACTCATAGATGAACAGATGAAGCACAAACTGACGTCCCAGGAGAAACGCCGCAAACTCCCCCCGCCGCGCGCACCGACTCGCACCGAGAGACTATTCGGGCTTTTTCACCGGTAA

Protein sequence:

>DPOGS201800-PA
MVCEIKTPLKHAGPEPRKARRRCFTVRCPKPIPTTVSVTCLNNYYSVNNLLFIYLSLEIIYLEWFPHAEDAVRDISWPDRDVITFEYWIHKAQFTVRLLETEWGRCHEKSLVHTDGYKRIARRCTLLFVYKNQENKPLYINSMVHISTSIQVTSMVLISELEQEKHLLRRRLDTEQGEYEARLVELQNDIKELTAKIDSKDNSVKQREEEKTGLIAELTAQNSRLTNQLKESSAVEAQLLAQLELLKDQCSIRKTSLQDHVQSLESLKAELALMSDKRADLERRLTTSLKDKDSLTQQLDEANDRISALERQLKEQEHLYQNTLKELERLQRSHDTLAERVGSDPVEITNTPRSLHAELESEPEEDENWLRTEAVQVFKQLRALALQLNTGHDDDSGLHSDLSLSSLDGDEGETLRRGALSAACADAVAAYAALEGSRVRDSIASHARRAMERERQIDEKNEIIAELSSKLSVAEVELRASADERDKLLNDATYSSLQHDEAVTKARQERDEAIERKKASEVALAKTRVELMQANSQLYEAVRQKIDLGQQLEQWQMDMQELIDEQMKHKLTSQEKRRKLPPPRAPTRTERLFGLFHR-