Monarch geneset OGS2.0

DPOGS208706
TranscriptDPOGS208706-TA4053 bp
ProteinDPOGS208706-PA1350 aa
Genomic positionDPSCF300043 - 237593-249283
RNAseq coverage296x (Rank: top 38%)
Annotation
HeliconiusHMEL0152480.056.14% 
BombyxBGIBMGA003350-TA0.049.55% 
DrosophilaCG7065-PA4e-2831.35% 
EBI UniRef50UniRef50_Q16Y012e-7433.56%Putative uncharacterized protein n=1 Tax=Aedes aegypti RepID=Q16Y01_AEDAE
NCBI RefSeqXP_001659440.13e-7533.56%hypothetical protein AaeL_AAEL008714 [Aedes aegypti]
NCBI nr blastpgi|1571195936e-7433.56%hypothetical protein AaeL_AAEL008714 [Aedes aegypti]
NCBI nr blastxgi|1973137763e-13032.86%uncharacterized protein CG7065 homolog [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL26689 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208706-TA
ATGGATCTTCCGGACTGTCCTCCGGGAGCCGAGGGTTACGAGGATACAGTCACTGAAGTACCTAAAAAGATTGATTCGGAAGACGCAGCGGCCTGTACTAAGGCGGAGCAGGAGTTCCAGCAGCATCTGAACAGTCTCAAGATCAAAACCGAGGATGGCAACTACCGCTGCATGGTGGAGAGACGCGGAAGAGATGTCTCAGGGAAGTGGATATACTTGTGCTATCCATGTGCAGCCATGTGCAGTGGGGAGAGGATCCTGCAGACTCATATATCAGGGAAGAAGCATAAAGCTAAGCTGTCAATGAGAAATGTTTGGCCATTGAGTATATTCAATAACCATCCATATGTGTCGAATTCAAACTCCAAGTCTGCAGCTACTGAGACGGTGCTGCAAAAGATGGCGGAGGAAGTGGAGATGTTCAAGAAGAATCCCTCCGAGTTGGATCTCAAGTATGATAAGTACCGGGAAGTGAGGTGTCATATACAGGACACACTTGATGCTGTCAAGGCTCCACTGCTAGGTATCGAGTATCTGATCGAGCATCCCCCCGAGCAGGCTCACTATGAACCTTCATACATGTGTACTCTGTGCGGCAAGCAGGGTCACCCCCGGACTATTGTCAATCATCTAACATGCTTCTGGCATCGATACAACTATCTACTTCGTCACTTTAACAAGGCCTGCGCCGCGCTGACTCCGTACCGGGCTCAGGCCAAGTACCGCGAGGGTGTCGCCATCATCATGAACAGGCTGGCGCAGCGCATCCAGGACAAGTACGGGCGACTGAGACCCGTGAACCTCGACAAGGAGGACTACGAGAAGGAGAGAGACCAAATACATCAGTGGATCTTCCGCGGCTACCACTTCACGGAGAAAGATAGCTGTACTTTCGAAGAAGTTGTGGACGTTGACTTGATTACGTCTCTAGATTCTACCAAGACTGCAGGAGGGAGAATCACATCTAACAGGGAGCCGTCGCCTCCAGTCGTAGCGGCGCCGTCTAAACCTTTCGGTTCTAAACGTAACCCTCGACGTCGCGGCTCAATGGAGTCGCTGTCGGACGTGAGCGACGAGCCTGACATAAGAAGCAACAAGGATGACAAAATGTTCAAAGGACGAGGCGAACCGCCGCGGTACGAGCCTTACGGGTCCCGCAGGAGAACCAGTCCTTATCCGGAAAAGGGTTCAACGTCTCGGCCCCACAACTACTCGTACAAGGTGAAGCTGGCTGACGAGAAGTGTGCTGCGGCCGAGCAGGCGGCCAGGAGGGCCAGGGAGTACCACGAGAAGAACCCGGAGAAACATCCGCTGTACCCGGACGAGTGGAGGAAGTTCTGGAACCGGAGATACAAGGAGATACAGGCCGAAGGCAAAGATCCATCGAAATACGACTTCAAGCCGGAGTGGATTGTGTACTGGACGGGGAGGATGAAGGAGCTGCACGAGGAGGAGCTCAAGACTACGGTCCTGGAAATATACAGGAGGTTGAGACTCACACCGCCTGATGCAAGAGAGAAACGTCGTTCGTCTGACCGCCGCAAATCCTCGGAACATCGTCGGTCCGCTGATAGAAAGAGATCCCTCGATAGAAGGAGGTCAGCTGAGAGGAGACGATCAGCTGACCGCAGGAGGTCGGCTGAGAGGAGGCCGTCCGCGGACAGCAGACACGCCGCGCCCGCCGCTAGCAGGCACTCGCCCTACAGGCGGACCCCGGAACATAGGAGGAAGTCCCCGGACCGCAGAGATCGCAGGTCCCCAGGTCACACTCGCACCGCACCGCGTCGCTCCCCTCTGAGGCATCTTCGCACACGATCCAGGAGTCCTATACATAAAGGGAGCAGTGTTCGTCGTCGCTCGCCGCTGTCTCGCCGCGGGTCGGTGCCTCGCAACCACAGCCCCTCCCGCGATCAACCCTCCATGCAGACCGTCCTTATCTCGGACGATGAACTTAAACCGGACGACGGTCTCTCTCCTTGGAACTCGGCGGAGTCCCTGGGTTCCCTTCCGGAGGCGAGGTCCCCGGTCCGTCGCTCGGCCTCCACAGGCGTATCTAAATCGTCTCGCAGACAAGATTTCCACAAACAGGACTACGACGCTGAAAATGTAGTTGCAACTTTGAGACTGCTAGTAGCTCTAGAAGACTACCTCGGCAGCTTGGGGCCTAAGATCGTAGATTTGTTGGCAGATGCGCTTAAGATGGAAAAGGACAAGGCAAATTCGTCCGAGGAGTTGCTCGACAACGAGACGGCGGTAGTGTTGATCGAAACGGCCAAAGAAAAATTGAAGGGAGCGGCTCAGGCCGGACTAGTCACTGGCAGCGCCGCCGCCGCCGTCAGGACAGCCGTAGTGAGGGCAGCCGCGACCCTACACGCAGCCGACAATAGACTGAAGAAGAAGAAGGAAAGCAAGGAGTGTTCCGGCGGCGGCGGCGTCCCGGTGGTGGGCGTGGGTGAGGTGGACCGAGCACAGATCGCTAAACAAATGGCAGCGGCTCTAGTGGCTCAGGGAAAAACAGACGTCTCCTCGGAGGAATTGGCTCAGCTTGTTGATGCTGTTGTGGGTATGGCCGAAGCGAAAAAACGCGAAGCGGAATCTAAAAAGAAGGCTGAAGCACGAGCCAGCAACAATCAGACGGCGCGACAATCGCTGGCGGCCTCGGGGACGACATCCGCGCTGAAGATGCTACAGTTCGCCTACGATGATAAAAAGACTGACAAAGAAGACGTTCCGGACGTGATGGACGGCCTGTCGGACTCAGATCTGGAGACTCTCCTTAAAAACTTCAACGAGTTATCGGCAGAGGAACAACATAGCCTCATAGCCTATCTTAAGAAGCTTGAAGCTCGAGAGCCGCAACGCGTGGAACGATTGCGACAGTATGTGAGCGCCGCGGCGACACACGTGCACGGGGACGCTGAGGACAAACCTAACGCCAAGGAACCGACTGTGGCCGTCGAAAGCGACGACGACGACTACACTGTGGAAGATGTGTTCAAATCGGCGACGCAAAAGGTAAAAGAAGATCAGATCCGCCAAGAAATGGAAATTGTGAAAAAATCATTGGAAGAGACCAAGGAATCTTGTGTTTTACTAGACTCGCCTCCAGCTAACTCGTCAGTTCCTAATATTATGAATAGCTTCTCATCAGCGACCGACCTCTTGGCCCTGGTTCAGGCTACGTTACAATCGACACCCGCACAAAACCCGGCCGTCGGTCAAGTGACGTCAGACGTAGTTATGAGTAGTACACAACCCAGGTCCTTCGGCGACCTGCCTGAATCTTTGAAACCTCAACCGCACTTACTTCCAACAGCAAATAAACAAACTCCTTTCATTCAGCAAAGTATGTCACAAATTTCTGCCAATGTAAGTAACATTGTTAATCAAAATAGTTTCCACAAAGCCAACCTTCCGTCTTGGGAGACTGGACAAGAGGCAATTTTAGATAGAGCCAATATACAAGAATTTGATAAAATTGACAACCAACAAGGAAGGGGTTATCAAGATAACTTTCATGACATATCCAGAGGATCCCAGGATAATTATTATCAAGGAAGTAGGAATCAGGATGGTAATTACTATCAGAAAGAACCAGAGTGTAATATTAATACAGGCACAAGGCTCGCGTACAACCAACAAAACAGTTATAATCAGACTTCAAAAGCAATGCACAATAATATGAATCAGGGACAGAATAATTTTAATCAAATACCTCAAGGGTCACAAGACAGTTATAATAATTATAATACAAATCCCAGCAGCTACAGCCAAAATAGAAGTCAAGGAAATTACAACCAAAACGCCAGAGAAAATCAGAATTCCTTTAACCAATTAGGTAGATTTGATAATAATTACAGTAACCCTGCACCGCGGAGTCCCATGGACAATTATAACTTAGGCCGAGGAGGTCAAGGCAACTCCAGAGGTCAGCAGGACGGCTCGTTTAAAAACCTAGGGCCGGAGGGATCCAGGCCGAGGCAGTCCGCTACTCGAGCCGGATCCAAACCCTCGATGGGTTCGCATATACGAGATATGACGGGGTGTAATTGA

Protein sequence:

>DPOGS208706-PA
MDLPDCPPGAEGYEDTVTEVPKKIDSEDAAACTKAEQEFQQHLNSLKIKTEDGNYRCMVERRGRDVSGKWIYLCYPCAAMCSGERILQTHISGKKHKAKLSMRNVWPLSIFNNHPYVSNSNSKSAATETVLQKMAEEVEMFKKNPSELDLKYDKYREVRCHIQDTLDAVKAPLLGIEYLIEHPPEQAHYEPSYMCTLCGKQGHPRTIVNHLTCFWHRYNYLLRHFNKACAALTPYRAQAKYREGVAIIMNRLAQRIQDKYGRLRPVNLDKEDYEKERDQIHQWIFRGYHFTEKDSCTFEEVVDVDLITSLDSTKTAGGRITSNREPSPPVVAAPSKPFGSKRNPRRRGSMESLSDVSDEPDIRSNKDDKMFKGRGEPPRYEPYGSRRRTSPYPEKGSTSRPHNYSYKVKLADEKCAAAEQAARRAREYHEKNPEKHPLYPDEWRKFWNRRYKEIQAEGKDPSKYDFKPEWIVYWTGRMKELHEEELKTTVLEIYRRLRLTPPDAREKRRSSDRRKSSEHRRSADRKRSLDRRRSAERRRSADRRRSAERRPSADSRHAAPAASRHSPYRRTPEHRRKSPDRRDRRSPGHTRTAPRRSPLRHLRTRSRSPIHKGSSVRRRSPLSRRGSVPRNHSPSRDQPSMQTVLISDDELKPDDGLSPWNSAESLGSLPEARSPVRRSASTGVSKSSRRQDFHKQDYDAENVVATLRLLVALEDYLGSLGPKIVDLLADALKMEKDKANSSEELLDNETAVVLIETAKEKLKGAAQAGLVTGSAAAAVRTAVVRAAATLHAADNRLKKKKESKECSGGGGVPVVGVGEVDRAQIAKQMAAALVAQGKTDVSSEELAQLVDAVVGMAEAKKREAESKKKAEARASNNQTARQSLAASGTTSALKMLQFAYDDKKTDKEDVPDVMDGLSDSDLETLLKNFNELSAEEQHSLIAYLKKLEAREPQRVERLRQYVSAAATHVHGDAEDKPNAKEPTVAVESDDDDYTVEDVFKSATQKVKEDQIRQEMEIVKKSLEETKESCVLLDSPPANSSVPNIMNSFSSATDLLALVQATLQSTPAQNPAVGQVTSDVVMSSTQPRSFGDLPESLKPQPHLLPTANKQTPFIQQSMSQISANVSNIVNQNSFHKANLPSWETGQEAILDRANIQEFDKIDNQQGRGYQDNFHDISRGSQDNYYQGSRNQDGNYYQKEPECNINTGTRLAYNQQNSYNQTSKAMHNNMNQGQNNFNQIPQGSQDSYNNYNTNPSSYSQNRSQGNYNQNARENQNSFNQLGRFDNNYSNPAPRSPMDNYNLGRGGQGNSRGQQDGSFKNLGPEGSRPRQSATRAGSKPSMGSHIRDMTGCN-