Monarch geneset OGS2.0

DPOGS210275
TranscriptDPOGS210275-TA1923 bp
ProteinDPOGS210275-PA640 aa
Genomic positionDPSCF300216 + 130472-140416
RNAseq coverage147x (Rank: top 54%)
Annotation
HeliconiusHMEL0086221e-17765.89% 
BombyxBGIBMGA000028-TA2e-17160.29% 
DrosophilaCG8420-PA6e-9238.13% 
EBI UniRef50UniRef50_Q9VHH19e-9038.13%CG8420 n=5 Tax=melanogaster subgroup RepID=Q9VHH1_DROME
NCBI RefSeqXP_001649376.13e-9538.60%hypothetical protein AaeL_AAEL004564 [Aedes aegypti]
NCBI nr blastpgi|1571065575e-9438.60%hypothetical protein AaeL_AAEL004564 [Aedes aegypti]
NCBI nr blastxgi|1954993412e-9639.44%GE25931 [Drosophila yakuba]
Group
KEGG pathway 
Orthology groupMCL16079 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210275-TA
ATGAAAAAACCAGAAGAGTCCATCACAGTACTTAGCAATTATATCGCGAAGAAATATCTGGACGTGAACACTGACGTATCCAGCTACGTGAAGTCGGCCGTAGACTACGTGAAGCAAACGTTGAAAATGGCGCAATCGGCTTCCGAATCTTTGGCGTCGAGAGATTACAGCGCGTTGGCGGAGAGACTGACGGATACGTTAAACATGGACGTGATAGAACCAGTCCTAAGGGTGTACCGAGCATACACACACGCGAGGACGAACACACACTGCCAGGAACACCTCATGTGTCTCGTCAACAGACAAGTGAAGCAGGGAGGTCCAGGGTTCAAGGCGGGTCTTACTAAGATGAGCAGCCTGGCCGCCTGCGCCGCTCTCAGCTTTGAAAACGGAAGAGGTTTCTGGGATTTATACAACGCTGTGCAAGCGGATGTTGACTGTGAGGCCAAGTACCCCGCTGACTGTTCATCGTTCCATGAACACGAGCTGAAAGTTACAACTGAAGTCTACCACAAAATGGCGAAAGTACACGTATTAGTTTTAGTGGTATGCCAAATCCTGTGTCTGCCGACCCTGTGTTATAGTGATGATGATAATCCCTTCCTGGATCTCGCGTCCTCGTTCATACAGAACATGGCAAGCGACAGTGGGAAAGGAAACAATATGGATGGGTTGGCAGCCATAGGAAACATTGTGGGCAGTCTTATGCAAGGTGACAATGCTAAGAATTTGGGCTCATTATTTGGACAGGAAAATGGTGGAGCTGGTGACGTCCTTTCCGGTCTTGGAAGTCTGTTTGGGGGTCAGGATGGTAAGATAGATCCTGCGGTCATAGGATCTGTGGTCTCAATGTTCGCTTCCCAGATGGGTTCTAATCAGAACCAACGAAGAAAGAGAGAATCTGATACCAATGATATAAATTTGGACAGTATCCTTAGCATGGCGTCAGGATTTCTTGGTAACAAAAACGCAGCAGGAATGCTGCCATTAGTTATGAACGCTTTAAGCTCATTCTCCGAAGACGAAACATCGAAACGAGCTGATTCTCACAAAGACCACGCGTCTTTTCTGCCTCCGTTCCTCGAAAAGGCCCATCTATATTGGGACATCTTCATAAATTCTGAATTAGGCAAGGCAGTCTGGGAGAAATCCGGTTTCCAGCGAGCGATGAAGTCGTTTATGGGTCCCGACGGGAAAGTCAGCTTTGAACTGATGTTTAAAAACTTCGAGAATCATTCATTTAGGAGGCATTGGATCAAGGCTGTAGCGAAATACCTAACGGGTATGGTAGTTCATGTCTCGAAACCAGAAGTTTACCAGAGATATTTGTCGACGGTACAATACGTGCTGAACGGTTTCCTGAGCTCGCAAGGTCTACCTAAAAACACTCACTTCAATATGAAAAAACCAGAAGAGTCCATCACAGTACTTAGCAATTATATCGCGAAGAAATATCTGGACGTGAACACTGACGTATCCAGCTACGTGAAGTCGGCCGTAGACTACGTGAAGCAAACGTTGAAAATGGCGCAATCGGCTTCCGAATCTTTGGCGTCGAGAGATTACAGCGCGTTGGCGGAGAGACTGACGGATACGTTAAACATGGACGTGATAGAACCAGTCCTAAGGGTGTACCGAGCATACACACACGCGAGGACGAACACACACTGCCAGGAACACCTCATGTGTCTCGTCAACAGACAAGTGAAGCAGGGAGGTCCAGGGTTCAAGGCGGGTCTTACTAAGATGAGCAGCCTGGCCGCCTGCGCCGCTCTCAGCTTTGAAAACGGAAGAGGTTTCTGGGATTTATACAACGCTGTGCAAGCGGATGTTGACTGTGAGGCCAAGTACCCCGCTGACTGTTCATCGTTCCATGAACACGAGCTGAAAGTTACAACCGAAGTCTACCACAGTGAATTATAA

Protein sequence:

>DPOGS210275-PA
MKKPEESITVLSNYIAKKYLDVNTDVSSYVKSAVDYVKQTLKMAQSASESLASRDYSALAERLTDTLNMDVIEPVLRVYRAYTHARTNTHCQEHLMCLVNRQVKQGGPGFKAGLTKMSSLAACAALSFENGRGFWDLYNAVQADVDCEAKYPADCSSFHEHELKVTTEVYHKMAKVHVLVLVVCQILCLPTLCYSDDDNPFLDLASSFIQNMASDSGKGNNMDGLAAIGNIVGSLMQGDNAKNLGSLFGQENGGAGDVLSGLGSLFGGQDGKIDPAVIGSVVSMFASQMGSNQNQRRKRESDTNDINLDSILSMASGFLGNKNAAGMLPLVMNALSSFSEDETSKRADSHKDHASFLPPFLEKAHLYWDIFINSELGKAVWEKSGFQRAMKSFMGPDGKVSFELMFKNFENHSFRRHWIKAVAKYLTGMVVHVSKPEVYQRYLSTVQYVLNGFLSSQGLPKNTHFNMKKPEESITVLSNYIAKKYLDVNTDVSSYVKSAVDYVKQTLKMAQSASESLASRDYSALAERLTDTLNMDVIEPVLRVYRAYTHARTNTHCQEHLMCLVNRQVKQGGPGFKAGLTKMSSLAACAALSFENGRGFWDLYNAVQADVDCEAKYPADCSSFHEHELKVTTEVYHSEL-