Monarch geneset OGS2.0

DPOGS210130
TranscriptDPOGS210130-TA2997 bp
ProteinDPOGS210130-PA998 aa
Genomic positionDPSCF300017 + 1729584-1745957
RNAseq coverage435x (Rank: top 28%)
Annotation
HeliconiusHMEL0071970.059.01% 
BombyxBGIBMGA000237-TA0.057.22% 
DrosophilaCG3764-PA1e-4730.93% 
EBI UniRef50UniRef50_D6WL071e-5334.54%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WL07_TRICA
NCBI RefSeqXP_970594.22e-5434.54%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892373654e-5334.54%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|2700070818e-7832.10%hypothetical protein TcasGA2_TC013532 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL18245 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210130-TA
ATGGCATTAATAGAAAACATCTTTAGAAGAAGAAAAATTGATTATGCTTATTGGGACTCACGAATCAGGGGCAGAGCTGGTGAATTAGCTTTGGAGCAAGTGAGATTGCTTCTGTATAAGGAATGTGATCGTAGAGGAAGAAAAGTGCTTTTTGATTCATCTACTATTGATAAAGTATTGTCTGCAAAAAATGACAGACCAGATATCAAAATGGAAAAAATTCCCTGCATCGTTGAAGTTACAGATGGAAACTCATACATTTACAAAGGTCAGGCGACGGATGCGAGTGTCCTCGGTGAGATGATATTCGGAGCGGTGGCAATGAATTGCAAGAGCGTTTCCTTTAAGATACATATCATGAATGAACCGAAAAGGTTAATGTGCACCAAGTTGTTCAGTGTGCCTGTATCAAGGAAGACCAGTCGGATGGAAAGGAAGGCAGATAGTTGTAGCGGAGATGTCAATAGATCGAGGCCTTTGAATATGACGTTACTGAGAGAAGAGGGATTGGCGTTGAGCTTCTCATTGGATAGAGGCGATTCTGGTTTCTGTGAGACATCTTCGTACAGCAGTTTCGGTACAAGCTTCGATTACCTCACCATGTTTCACGACTGGGATCAGAACGGCGACGAACACTACTTTTATAGTCCAAGTTCAAAGTTAAGCGCTTCAAGCGGTAGTTTAGCTAGACGTACGTCCCATAGCTACGCTACTCGCTTCGATTTCGGGAATACTTTGAAAGTACCCACCAGTAGTACAGTGTGTAGCGCGGATTCGCAGTTATATAGCAGTTCAACGAGTACGTCAGACAGTTTGGCGAGCACTGCGTCTACACGACGCGCTAAACTCGGACTAGCGTTGCTAATTACCTTTACTGAATCTGATGATATGGAGTTGATCCGTCGCTGTTTAGAGTTATCACCACAGCTGCGGTCGCTCGTGTGTCGTCTACGTCTCGCAGCTCTCACCGCTGGCTCCGATGCATTCTTCGTGTCCACGCTACACACGGCCGCAGGACACGCGAGGAGATGGTTGTCGGAGTTATTGTTCGGACCGCGGCTTCATCCCACGTGGCTGCTTCTCGTGTCCAGTGAGCCTTCACAGGCAAACAAAATGGCCGACAGGCTGATAGAAGATATATGCTCAGTACTAGCTATCGGGGACACAAAGGATACTAACTTTTTCATCAGCACCCTTCTAACGCACGTGTTAACTCACCACCTCGGCTGGGTGACCACTGTAAGTCCCTACGACAGAGTGGAATCAAAAAATACAGGAAATTTGGACACTAAGAGGCCATACAACGCGCTCTGGGCACAACTGACGGATCTCTGTGGATGTATAGGATTTCCACCGAAATCCGCTAGGACCATCATAACCGGCAACAAGAATATACTGTTCATTAATAGATTACTGGACGTTCTTACTTATTTTATACGTTGCGGTGACGTTAAGAAGAACGATTTCGTGTATCGAGACTGTTCAGTGAGCGAGGTCAAAGTCATCAATGTGAAAGCTGCTAGTGAAAACGATAAGTGTGATTTGAATGACTATAGTACAAAATATAGCTTGAAAGTTCCATCATACAGCGGTAGCAGTGCGAGTACTTTAGTGTCCAGTGAGGTGTCTTTGAAGAAATCGGCAACGTTCGTTGATTTAAATAATGTGCTCTCTAATTTTGACTTTGCATCTGATAATGGGAGCAAATTGAGACGGCATCCGACCATGATGATCTCATTGAAGGATTCTGACTCCAGCTCAAACGCGTCATCAGAAGAATGTGAGAAGAATGTTGTGTTCGTGCTCGGTGATGATGAGAAACTTGTGGGGTTGAAGAATAAATCAAACGGTAAGAGGAATCTAAAGAAAACTTCAAGAGCTTCGGAGACAGAGGAGAAAGAGAAAGAGGAACGTGACGACGTGAGCCAAGAAAAATACAAGTCATCACAAAGCCCGAAGTGCTGTGACCAAACACTCAAACATTCCAAGCCCATAAAACATTCCGGTTTTAAATTCGAGTTCGATAAATATCCGCAGATAGTAACTAATTATATGAAGAGCAAGAACTTAGAGATTCTAGATAGACATTACATAGGGAAGCCGGGGAACCTGAAACTGGACAATTTCCAGTTCGATCCAACATTCGTACCTCCGATACAGGAAGACAGATGCGAGACCTGTTACAAGTGCCAGCTGATGGAGTCCATGTTGCAAACTCCCACCAACGCCTCCGAAATGGAATATATGAACGATATACCGAGACAATCGGAACCGCAGATCGCTAAGGAGACGATAGTCCAAGAGGAAATGACGCCCAAGACATTTGTTAGGAAGCGCAAAGAGAGTACCGTCGTCGTGAATGTCAGGAAGCCTGTGACCGAAGTGAAGGTGAAGGTGGACGAAGAAAAAGATAACAATGTAATAGAAGTGAAGCAAGTTTTAGAATTCCCTGTTCCCCAAGTGTGTCCCATCGCTAAAACTGACTGCAACGATACACTTCTAGGCGGCATTACTGACCACTATGTACCTGATCTTATATTACAAGGAACAATCGCTAATCCGGACACATGGGAATCGGAACTTCGGAGGGATTTAGATCTGACGTCGTATTTGAACAAGAGCTCTGATTCGCCTATACAAACTGTAGCCATAGTCGGTGACACTAATACTTGGCAGGTGAGAGTTTGTGGGCGATCAGTTGGTCCTATGGGAGGGGGATTGTCGCCACTCGTTGGTGGTATTTTGGACGCTTTACCCGCCATGAGAAAAGCCAACGTACCCGCCGCTCAGTGCCTACTATTTTTGGAGAGCAAACTCCGCGAATTCTGTGTACTATCAAAAACTTTAGCCGACATATTGATGTCCACCGACTTTTGTGATATAGCAACACTAACAAAATCTTTGAATGTGGATGTTAACGACGTCCCTCTATTATTGGCGGTCGCTACAACCCACACGCCGGAACTAGCTACTAGATATGGTATTAGCTATAGATGA

Protein sequence:

>DPOGS210130-PA
MALIENIFRRRKIDYAYWDSRIRGRAGELALEQVRLLLYKECDRRGRKVLFDSSTIDKVLSAKNDRPDIKMEKIPCIVEVTDGNSYIYKGQATDASVLGEMIFGAVAMNCKSVSFKIHIMNEPKRLMCTKLFSVPVSRKTSRMERKADSCSGDVNRSRPLNMTLLREEGLALSFSLDRGDSGFCETSSYSSFGTSFDYLTMFHDWDQNGDEHYFYSPSSKLSASSGSLARRTSHSYATRFDFGNTLKVPTSSTVCSADSQLYSSSTSTSDSLASTASTRRAKLGLALLITFTESDDMELIRRCLELSPQLRSLVCRLRLAALTAGSDAFFVSTLHTAAGHARRWLSELLFGPRLHPTWLLLVSSEPSQANKMADRLIEDICSVLAIGDTKDTNFFISTLLTHVLTHHLGWVTTVSPYDRVESKNTGNLDTKRPYNALWAQLTDLCGCIGFPPKSARTIITGNKNILFINRLLDVLTYFIRCGDVKKNDFVYRDCSVSEVKVINVKAASENDKCDLNDYSTKYSLKVPSYSGSSASTLVSSEVSLKKSATFVDLNNVLSNFDFASDNGSKLRRHPTMMISLKDSDSSSNASSEECEKNVVFVLGDDEKLVGLKNKSNGKRNLKKTSRASETEEKEKEERDDVSQEKYKSSQSPKCCDQTLKHSKPIKHSGFKFEFDKYPQIVTNYMKSKNLEILDRHYIGKPGNLKLDNFQFDPTFVPPIQEDRCETCYKCQLMESMLQTPTNASEMEYMNDIPRQSEPQIAKETIVQEEMTPKTFVRKRKESTVVVNVRKPVTEVKVKVDEEKDNNVIEVKQVLEFPVPQVCPIAKTDCNDTLLGGITDHYVPDLILQGTIANPDTWESELRRDLDLTSYLNKSSDSPIQTVAIVGDTNTWQVRVCGRSVGPMGGGLSPLVGGILDALPAMRKANVPAAQCLLFLESKLREFCVLSKTLADILMSTDFCDIATLTKSLNVDVNDVPLLLAVATTHTPELATRYGISYR-