Monarch geneset OGS2.0

DPOGS214113
TranscriptDPOGS214113-TA1788 bp
ProteinDPOGS214113-PA595 aa
Genomic positionDPSCF300014 - 1730057-1733972
RNAseq coverage36x (Rank: top 74%)
Annotation
HeliconiusHMEL0114000.065.77% 
BombyxBGIBMGA006163-TA5e-4548.50% 
Drosophila% 
EBI UniRef50UniRef50_UPI0000E471412e-1736.31%UPI0000E47141 related cluster n=1 Tax=unknown RepID=UPI0000E47141
NCBI RefSeqXP_001181222.14e-1836.31%PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
NCBI nr blastpgi|1156757677e-1736.31%PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
NCBI nr blastxgi|1156757672e-1634.34%PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
Group
KEGG pathway 
Orthology groupMCL25213 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214113-TA
ATGCCTCCGAAAGCGAAAGAGTCTGCAACAAAAAAGCCGGTCCCTCAAGTTCTTCGTCCCAGTATTAGCGATTCCAGTCTAAATCAAGTACCCGATGATGCGATTGAAGGCTACCAAGAACGTATAGAAACCGGAATAGAATGGACTCTATTGGAAGATGCAGTCCGTCGACACAAACAGAAATGCACTAGAGATAACATTCCCGAGATAACATTTTCTAAAGCTTTTAAAAATAAAATTAAGCACAGCATTCTCAATGGATTCATGGAACCTGACGCCAATTTACAAGAACAATGGGAAGTTTTTGTACCTATGAAAATGTGGGATATCGAAAAGTTTGCTAATGACGAAGGAAAAATATGCTTTAAGAGCTCCAACAACCTCACCGATGACATGTCTAAATTAATTAAAAGATCAGTGCTGTTTGGTGATAAGAAAACACTTAGACGGGACCTTAAAAAGATCAACGTATTGAGAGTGACGGATTCTGAGATGACAGAATTAGATAAGTCATTGATGGAATTTGATATCCTCGTCACACTCAACTTATGTGGTAACTATCTCACCGAGATAGATGCTTGTACACTTCCGCAAGGATTACGGATGCTTGAATTACAGGCGAATCATATAAGCAATTTGGTGCCATTTGTTGAAAATTTACCTACCAATTTGATTTACCTAGGGCTAGCAAGGAATTTTTTGGACAACTCTAGTGTTGAAGGAATAAGCAAAGGGCCTCACTACCTTACTGTTTTGGATATCTCAGACAACGACATCTGTGATCTCGACATGGTACTAGATGCTTTATCGACTTTACCAAGTTTGACCGGACTACACCTTGCCGGGAATCCATGCTCGGTGTGCGCAGCGTATGCACGTTCAACACTGATAAAGCTACCCCGTCTTCAATGGCTAGACTATAGGGAGGTGTTAGTGACGGATCGTCCTTTGGAACCTGTGGAGCCTCACCCGGATGACTTGCGATCAGCTTATTTCACATTCACCGTCTTCAGAATTATATCAGCGCCTCAACCACCAAAGCCTGACAAGGGTGCTGTAACAGCTTTTCATGTAGAACTGGAACTGCCATTACTGGATTCTTCTAGAAGACAATTCTTGATGTTTAGAAATAACGAGTCCTTAATAGAAATGTTGCCACCTCCCGAGGACGAAGAATGGCCGTCAACAAAATTTTCTAGATCATTGATTAAATTAATTGAGAGTAAATCAGCAGTAGAACCTGAAACGTCCTCCCACGAATCAGACATTTATAACAGACTGACGACGAAAAACTCCCGTGAGATCATCCATTACACGATATTTGAGAGCAACAGAGTACAATGGAACAAACTCATGAATTTCCAAGAACCGACCATAAAAATCTTCTGTCCGAACCTCAGAGCTCTGAGAGATACATTTCGCACAACCATCACCCTGAGAGTCATCTATTCTGTGACAACCACTGGCAAACAAAGCAAGCCAGACAAGAAGAGCGCTCAGAATCTGAAGCCGCCGGGTGAGCAGCGTGTGACCCTCGCCACAGTGAAATGCTCCTTGAGAAAAGTGGACTGGAGTCAGCCCAGCCAACACTTCCACTGGGACGACTCGCTGGGGACCGACGAGGCTATACACTGGGGAGACGGGGACCTCTCTGTATTACAATATACTCTAGCGGCTGTGAAAACCACAAAAGGCAAACCTGATTCGGACCCCGGCTCCACCAAACAGTACCCACCAGATAACTTCACCTGTCACTTTGGATTCGGTATAGACACTTTGAAAGCGTAG

Protein sequence:

>DPOGS214113-PA
MPPKAKESATKKPVPQVLRPSISDSSLNQVPDDAIEGYQERIETGIEWTLLEDAVRRHKQKCTRDNIPEITFSKAFKNKIKHSILNGFMEPDANLQEQWEVFVPMKMWDIEKFANDEGKICFKSSNNLTDDMSKLIKRSVLFGDKKTLRRDLKKINVLRVTDSEMTELDKSLMEFDILVTLNLCGNYLTEIDACTLPQGLRMLELQANHISNLVPFVENLPTNLIYLGLARNFLDNSSVEGISKGPHYLTVLDISDNDICDLDMVLDALSTLPSLTGLHLAGNPCSVCAAYARSTLIKLPRLQWLDYREVLVTDRPLEPVEPHPDDLRSAYFTFTVFRIISAPQPPKPDKGAVTAFHVELELPLLDSSRRQFLMFRNNESLIEMLPPPEDEEWPSTKFSRSLIKLIESKSAVEPETSSHESDIYNRLTTKNSREIIHYTIFESNRVQWNKLMNFQEPTIKIFCPNLRALRDTFRTTITLRVIYSVTTTGKQSKPDKKSAQNLKPPGEQRVTLATVKCSLRKVDWSQPSQHFHWDDSLGTDEAIHWGDGDLSVLQYTLAAVKTTKGKPDSDPGSTKQYPPDNFTCHFGFGIDTLKA-