Monarch geneset OGS2.0

DPOGS214789
TranscriptDPOGS214789-TA5301 bp
ProteinDPOGS214789-PA1766 aa
Genomic positionDPSCF300059 - 726083-742603
RNAseq coverage195x (Rank: top 48%)
Annotation
HeliconiusHMEL0049700.076.58% 
BombyxBGIBMGA012100-TA0.066.56% 
DrosophilaCG14215-PA5e-3624.20% 
EBI UniRef50UniRef50_Q16RF03e-7631.61%Putative uncharacterized protein n=2 Tax=Culicinae RepID=Q16RF0_AEDAE
NCBI RefSeqXP_001661219.16e-7731.61%hypothetical protein AaeL_AAEL010978 [Aedes aegypti]
NCBI nr blastpgi|1571279021e-7531.61%hypothetical protein AaeL_AAEL010978 [Aedes aegypti]
NCBI nr blastxgi|3454950267e-8023.11%PREDICTED: hypothetical protein LOC100122107 [Nasonia vitripennis]
Group
KEGG pathway 
Orthology groupMCL17869 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214789-TA
ATGAGCACCGCGAGGTGTCCTGAGATCAGCCGCGGGTCATGCGTCACGTTAGACTTGAGGTTGGATTCAGAATCAGTCACGCCATACTCACACGCCACGCGACTCGAAGAACATTTCTTTCCTAATTCCTTGCAATACAACTGCGTGGCTCTCAGTACGTGGTCGACGGCGGTGGTTCAGACGGCGGGCGTCCAGCGTCAGCTGGTCCGCGGTATGAGCGCGGCGGGCCCGGCGTGTCTGCTCGCCCCGGCCAGATTGTACGCCGCGTGTAAAACCGCCGGTCTGACACCGCTCTACGCTCCCTCTGGGGACACACCGGAGGATCAACGTCGTTTTCTTCTGTCTGTGGCATTAGAGGCGAGGCTGTCGTCGTTCCTGAAACGATGTGCTCACGACTGGGCTACGGGGACCCACAGTGGTGTCGGATGTACATTGCCTTTCTTGGTGGATTGGTCGTGGAGTAGAGCTATTGAATTGAAAGAAAACGCTAAAGAACTGACCGCGCCGCTGTTTACTTCCTCTATGATGCCAGACAGGAATGTTATAAGATGTCTGGAGCATTGTGTGCAACAACTGAGTCAACTGACTGGTCTGTTAGACGCCATCCTCACTAAATGTTGTAATCTAGTTGTACCAGACGCGTTAAGCGAAATGGAAGAAAAATATAAAGGCATAGGCACAGTATCCTTATACTTCCAAGTTGTACAATGGTTTGTTAGGGTCGGACTGTTGCCTGAGAAGAGTTCGGACAGACATTCCCACACGTTACCCTACCCGGTGCATCAGCTATGTGGGATATATAACAAGCGTCGCATTAAGCTCAACCGTCTTCAAGACAAGTCGGACGACGAGTCGAGCAACGAGTCGTGTTCGCTTCTGTACATAGATCAACTCATTGAACACGAGTTCGGGGGAGATAGAGTCCATCAACTATGGATGGTATGTGGATCGAGCGGCGGTCTCTATCCCCCCCCTTCTCTGTTTTCCCTCTTGAGACTATACTTGCTGCCGGATGTTCCCGAGGAACACAAGCACTCTCTCCTACTCTACTTACTCCTGGATTATTCTATGATCTATGATGATATGCGCCACGAGTCCGTGATACGTCGGTTGATGCAGTTCCCAACTATGTTCGGTCTCAGTAACACGGCGATTAAAGCCACCCAAGCCTTCTGGCATCTCGACCACAGAGACTTTGATTTCGCTCTCGACCAACTTCAATGTCTAACTGGCAACACTCTCTCCGACTGGCAACATCACGTAGTCCTGTCTTCGTTACTGGCGCAGAAGAAAACTCAATCAGCGTTACAATACTTACACGTGAGAAAACCGGCTCCGATACACGTTAGTGACAACAATGATTATGACAAACTAGACGATTGGCAAACCTCCTGCAACCTGTACCTGGCCCGGGGCCTGGTGTTCGAAGCTTTGGATGTTATAAGGATGTGCGTAGAAAATGCCGGCTCCAGCGACGATAAGACGCAATTGTTGAACTATTTCTATAAAGGCTGTAGGAACAGCGGTCAACTGGCTAAGGTGCTGCAAGTAACGTTGTTGCCGTTTGAGGAGGAAGTGTTCATCAGATATCTCAAGGAGTGTAACGAATCCCACACATCAGACATCCTCGTTATGTACTACTTGCAACAGGCGAGGTACCTAGAAGCGGAACAGTATAACAGTAAGTTAAAGACTCGTCACGAACAGTCTGACCGTGGGTCGGCTCGCGACGCGCTGGTGGCGACGCTGTGTAGAGACCTGCCGGATGTCACAGGAGACGTACTGAGATGTGCTATGAACGAAGCCGAGCCCAGGAACTCGACGTTAATGTGTAGTGAACGGTTACCAGAGGTACCGGAGAACAAAGACAAGTTCGTGTACATCTGTTCAACTGCTGTTGCTCTTTCAGTATACAAGAACACAACACACCCTCTCGAATCTCCGCCTAAGAAGTTCGCCGCCGAAGATATATCTCCCAGGAAGTCTTACAAGGATAACGTTCGTGCGAGGCGTTCCTTATCCATATCAGCGAACAGCAGTCTGTCAGAGGATCCGAACACGTCCATAGAGAGTATAGCGGATATCCCGGTGACGCTCATCAACCCGAGATACACGGGCGAGAGATACATGAGAGATACTGAGGAAGAGAGAGATAGAGATACTAGGAACAAGATACACGTTGAAACCGAGAAGAGAGATACTAGCTATGTACCGAACACACCTAAAGGGAGACGAGCTATCAGGAGCGATGGAGATAATACACCACTGAGCGGAAGTCGGTCAAATACGCCGGACCGATGTGATTCACCCATCATCACGCCGAAGCGAGTTACCAGAAGTACCAGGAGTCGATCTCGTACGCCAGAAATAAGTCCAAAATCATCTCTCACGCCCATAGAGGAGCTGCCGAGACAGGAAACTGACACCGCGTCATACAAATCTAAATACATACCATCGCCTAGGGAGCATTTTAATATATTGATTTTACTTCATAGAAGGTCCCGTACGCCCGAGAGGATAGAAAAAGTAATAGAGCCTCCGCGTCTTGAAGCAATCAGTGAATCACCCACCAAGTCATCACAGCCGCCACAATCACCCACCAGACGTAGTCTCAGAAGCCGTTCCCGTACGCCGGAAGTGGAAATTAAAGTTGATCCGCCTGTTATAACAAGTCCTCGTAGCCTAAGAAGCCGATCGAAAACACCCGAAAAGTTGATGTCACCGAAGAAAGATCACGGAACCCGGAAGAAGTCGCTCTCGAGAATAGTTCTAGAAGCTAACGCGTTCGCTAAAACTAAGCAGATAGAAAAAATGGACGAGGAAGATAAACCGGATGCAACCGGTGTTATAGAGTGTACTCCTGTGAAGCCTTCCAAACACACCGAACCGCCCCATCCCTGTCTCATGGACGTTGAATTCTCGCCAATAGTCAACAAATCAATCCTACACAGTTCATCCGAGAGCTTCTCGATCACAGAGAAAATATCAAAATCGAGCGAAGATCAAATAGAAATTAATCAACTACCCGCATTCACTATCAACGAGATATACACCGACAAGTCGGTTCTGCACAGCTATCAGAGCAGTATCGGCGGCACAGAGTCGATACACGACACACGGGAGAGTATTAAGGAAACAGTCGAAATCTGCAAACCCCTCCCAGCGTTCACGAGCATTACTGATGACTTCGGCAAATCGGTGCTGCATAGCTTCGAAAGCACCATCGACTCTAGTAATATAGAAAGAAAGGAAACGCCTAAATCAAAGACAACGGATTATGGAGTTCTGAGTACCGACACGAGTGCGAGTGTCGGTCAACAAAACGAAAGCCTCATGACCAGTGACAGTGAAATCGACGATAGAGAATGGAGTCGACTCGACAATCAGAACGCGAGAGTTATACAGAAGGAGAAGGAGAGGATAGTGGAGATAGAGAGAGAGATAAACGAGATAGAAGAGATGTCGGATGAGTGTCGCGAGACTTGTAGTGATTCTAACGACAGTGCTGAGAGCGAAGAGGAAGAAGAGGGCTCCGAGTCAGAAAATGACAACGAGGTTATTTCCATCGAAGATTCCGACGAGTCATCGGATACATACAAGTTGGAAATTGAAGAAGACAATGTTATAGAACGAAACCAACAAGAACAGCTTGTACAGGCTACTGTCGGGACAGCTAGAGATGACGTCGCCGAACCCCAAGCATCCGGTTCCGGTATCCAGGAAAATAATGAAGCAGATTCAGAATTATTCCAAGCATCACAAGAAAAACCTGATGCACTCAACCCAGCGCGTGATATATCGATTCTAACAGATGATAACTCTGTCACAGAAACAGAAGGCGCTGATAAACTTCACTACTCCGATGAAATCAAAAATGACGATCAAAGCGTTGCCATGGAGACAGATCAGCCTGTGATAGCATCCGTGACTGTAGAACAGATAACGACTATTGAACAGAATATAGAACAAGAAATTCCCTCCAGAGAAGTAGATGTTATAGATTCGGGTCCCTCTAGGAGAGAAGATCACAAAGTAGAAGAAAGTGAAACTGTCAAAACAACTGAGGCACAGATGGAAAGTCAAGCAAAGTGTGTCCCAGATATAAAAGATAAAGATATCCCTGAAGAAATAGAAAATAGTGTCGTGGCCCAAGAAGAAAGCGACAAAGTTGAGATAAAAACACAGACAGAGGACATTGAGAGAACAGAAGACACGGAGCAAGTTCCGAAAACAAGAAAACGCACGAAATCTACGACCTCGACCAGGTCTAAAGAAGGTTCAGAGGTTAATGAGAAGGTGGAACCGAGAACGCCTCGCAGGCGACGGCAGTCAGCGAGCAAAGCAGAAGTCGAAGGAACTCCCGACAATAATGAACACACGCCCAGAACACGAGCCAAGACACCCAGCTCTGAAGTGAGAAAAATATTAACACGCCGCGCCTCCAAGGAGCTGGAGAAAACGGACGAACATAGACAGGAAGAAGTGCCAGAGCTAACGCCGAGGAGATCCACACGGAGGACTAAGAAAGAGGACGATAACGCTAGTGTTACCTCGGATTCCTCTGTCAAATCCGGCAGAAGCAGAGCTAGCGAAGACGGGAAACCAGCGAGGAAAGGAAGGAAGTCGGTGATGAACGTTAAACCGGATCTGACAGTCATACCGGAAGTCGCCGCCGAGGAGTCGAAGGCTTCAGATGATATCATCAAGGAATACTCGAGTGCTAGACGGTTGACGCGCAATCAGAAGGCCGTGTTGGACTCGTGGTTGGAGCCGGAGCTGCCTCGGAGGAAGAACGACAACGACTCGGATTCCAGCCTAACGAGGACCGACTACGACGGCTCCCCCGAACCAGATCAAGACTCGGACTTAGAGACCGCCGTCAAACCACAGCGCTTAGCAAGAGCTGCCTCTGAGACGAAAACAACTCCGAAGGCTGCGAGGATCGGTAGAAGGGTTTCAGTGGATATAGAATTTGAGACCATCCAATTATACGTGAATTTTGAGAACGGTTTTAGGATGCCAAAATACAAGTTTGTATGTGGTGATATACTGGGTGTGACTCCTGACGAGGGTTCTCCTGTTTCTGGTCTTTCCCGCGGACGTCGCGCCTCCTTTACGCGAGCCTGTGAAGCTCTTCACACACCCAGAAGGGGTTCCACCGATGTTAAGGGTTCCCCTATGTCAGAGGTCGAGTCGACCCCACGGAGAGGTCGACGACCTTCAGCTACGAGCGTCAAGGGAGATGTTAACACGCCCAGGTCGAAAAAAGCTGCGGAAAAAAATAAAGAAACAAACGAATAA

Protein sequence:

>DPOGS214789-PA
MSTARCPEISRGSCVTLDLRLDSESVTPYSHATRLEEHFFPNSLQYNCVALSTWSTAVVQTAGVQRQLVRGMSAAGPACLLAPARLYAACKTAGLTPLYAPSGDTPEDQRRFLLSVALEARLSSFLKRCAHDWATGTHSGVGCTLPFLVDWSWSRAIELKENAKELTAPLFTSSMMPDRNVIRCLEHCVQQLSQLTGLLDAILTKCCNLVVPDALSEMEEKYKGIGTVSLYFQVVQWFVRVGLLPEKSSDRHSHTLPYPVHQLCGIYNKRRIKLNRLQDKSDDESSNESCSLLYIDQLIEHEFGGDRVHQLWMVCGSSGGLYPPPSLFSLLRLYLLPDVPEEHKHSLLLYLLLDYSMIYDDMRHESVIRRLMQFPTMFGLSNTAIKATQAFWHLDHRDFDFALDQLQCLTGNTLSDWQHHVVLSSLLAQKKTQSALQYLHVRKPAPIHVSDNNDYDKLDDWQTSCNLYLARGLVFEALDVIRMCVENAGSSDDKTQLLNYFYKGCRNSGQLAKVLQVTLLPFEEEVFIRYLKECNESHTSDILVMYYLQQARYLEAEQYNSKLKTRHEQSDRGSARDALVATLCRDLPDVTGDVLRCAMNEAEPRNSTLMCSERLPEVPENKDKFVYICSTAVALSVYKNTTHPLESPPKKFAAEDISPRKSYKDNVRARRSLSISANSSLSEDPNTSIESIADIPVTLINPRYTGERYMRDTEEERDRDTRNKIHVETEKRDTSYVPNTPKGRRAIRSDGDNTPLSGSRSNTPDRCDSPIITPKRVTRSTRSRSRTPEISPKSSLTPIEELPRQETDTASYKSKYIPSPREHFNILILLHRRSRTPERIEKVIEPPRLEAISESPTKSSQPPQSPTRRSLRSRSRTPEVEIKVDPPVITSPRSLRSRSKTPEKLMSPKKDHGTRKKSLSRIVLEANAFAKTKQIEKMDEEDKPDATGVIECTPVKPSKHTEPPHPCLMDVEFSPIVNKSILHSSSESFSITEKISKSSEDQIEINQLPAFTINEIYTDKSVLHSYQSSIGGTESIHDTRESIKETVEICKPLPAFTSITDDFGKSVLHSFESTIDSSNIERKETPKSKTTDYGVLSTDTSASVGQQNESLMTSDSEIDDREWSRLDNQNARVIQKEKERIVEIEREINEIEEMSDECRETCSDSNDSAESEEEEEGSESENDNEVISIEDSDESSDTYKLEIEEDNVIERNQQEQLVQATVGTARDDVAEPQASGSGIQENNEADSELFQASQEKPDALNPARDISILTDDNSVTETEGADKLHYSDEIKNDDQSVAMETDQPVIASVTVEQITTIEQNIEQEIPSREVDVIDSGPSRREDHKVEESETVKTTEAQMESQAKCVPDIKDKDIPEEIENSVVAQEESDKVEIKTQTEDIERTEDTEQVPKTRKRTKSTTSTRSKEGSEVNEKVEPRTPRRRRQSASKAEVEGTPDNNEHTPRTRAKTPSSEVRKILTRRASKELEKTDEHRQEEVPELTPRRSTRRTKKEDDNASVTSDSSVKSGRSRASEDGKPARKGRKSVMNVKPDLTVIPEVAAEESKASDDIIKEYSSARRLTRNQKAVLDSWLEPELPRRKNDNDSDSSLTRTDYDGSPEPDQDSDLETAVKPQRLARAASETKTTPKAARIGRRVSVDIEFETIQLYVNFENGFRMPKYKFVCGDILGVTPDEGSPVSGLSRGRRASFTRACEALHTPRRGSTDVKGSPMSEVESTPRRGRRPSATSVKGDVNTPRSKKAAEKNKETNE-