Monarch geneset OGS2.0

DPOGS200485
TranscriptDPOGS200485-TA3690 bp
ProteinDPOGS200485-PA1229 aa
Genomic positionDPSCF300158 - 130318-134456
RNAseq coverage37x (Rank: top 73%)
Annotation
HeliconiusHMEL0043854e-1932.96% 
Bombyx% 
Drosophila% 
EBI UniRef50%
NCBI RefSeq%
NCBI nr blastp%
NCBI nr blastxgi|1234386774e-2620.43%viral A-type inclusion protein [Trichomonas vaginalis G3]
Group
KEGG pathway 
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200485-TA
ATGTCTTTCGATCTCATCATGGAGCCGAATAAAGATGAACGAGTCAGAAAATCGCACAGAGGACTGTTACAAATGTTTAGACCTGGAGGATGTCTCAGCATTCACAACGATGAAGAAGAATACTCCTATATGCCGGGCACTTCTAACGAATTAAATAGAAGCTCATCTGGACAGGAAGACTTCCATGTTAAAACTACGAATGAGGCATTTTCTTTAAAAAAGAATGTTTTTTCTCCCTCTTACACGACTACATTTAAAACACCTAAGCCATTTTTGGGTAAATGCAAACCTGGTGGCTGCCTTGATCCACCGTTTGGTGAAGAGAAGTACATTTACAAACCTACGTTCGAGGACAACCAAATTGCAGAGAAATCTCCACACACAAATGATATAAATAAAATGCTGCTAGATTCAGATAAAAATAAACCGGAATTAGGAATAAAAAAACAAACTTTGGATCAAGAAGTAGAAGATTTGCATGAATCTTACACCGATTTGACCGAGAATATTGAAGATTCAGAATCTCTTGTTACTTACCCATTACCAGTATCATCTAGTTATATTAAAAATAGGGACAGTCAAAGTAAAAAGGACAGATCTAGAATCAACTCTGAACAAACATCTCAAAAATTAACCAATGAATTTCCTGTTTCATTTTTTATGCCTCCTAGAAAAGAGAAAACAAAACAATTTAAGACCTATGATGATTACAATGACAAGGAAACTAAGGTAAAAGAGGCATATATTAACAATGAGTATGCAATGCAAAGTCAAACACCTAGCGAAATGTTCTCGGATTCATTGACAACGACACATCCATCGCGTGAAGAACCTTCACAAGACGTAACACGGGAACAAGAAACATCTGTGAAATCGATACATTCATCGAGAAATGATGATATTTCTAAGGAACCAGTAAAATCTGAAGGTTTAGATTATATTTCTGATATCAAAGCTTTAAGAAAAGGTGAGGAAATATTTTCACAGGAACAAGTTACTGCAAACGATAAATCCGGGACAGGAAATCAGAACATACGATCTAAGTTGCCAGAAAGTCATATTGATACATCGTATCAATCTGACAAAATGAATTCAATACAAAAGATGGAAGTAGAAAACATAGTGAGTGAGGATAAAAATATAGAACCTGAAAAAAAACACAGTTCAGCAAATGTTAAGTTAAGCAGTGAAAAGTTAAAAATAAACTCACTTTCTGAATTAAACAATCATTCAGATAATTCAGAAATAAATAGTATTGCTGACAAACCTGTGGAAGGAGAAACTGGATTACGCAACGGAATTAATAATCTTTTAGACCCAATTGAAAAAGAGAAATTTAAAAAGAATTCTTTAAAATATGAAACTTCAAAAGATTTTGGGGTAAACGTTGGGAAGCATGATTCAACAAACCTAATCAAGCCTAATTTAAAATTCAGCCATGAGGAATTAACTAAAACATCAATTGATGATCCAACCAAAATTTTAACATACTCAAGTAGTGAAAGCATTAATGATAAAGATATTAAAATAGAAAATCAATTACGTGATGGAGTTGAAAGTAATTTAGATCCAACGCAAAAGGAAATGTTTGAAAAAGATTCTCTTAAATACGAAGCTTTAAGACATTCTAAGGAAATAATTACAAATGATGACACTATAAAAGTAAGAAAATCTAATGTAAAATTAAGCAAGGAAGATTTAGCTGAAATATCGATGGATCAATTAAACAGGTTTTCAAAAGATCTAAAGAATGAAAGTATAACTGACATCAACGTTAAAAGAAAATCTCAATTGCCTAATAAAACGAGTAGCATTTCAGATCCAATGCAAAAAGATGCTTTAAAATACGAAGCTGTAATAGATTCTGGCCCAAAAATCACAAATGAAGACCCAATACAAATCGAGAAAACTGATGTAAAGTTAGGCCAAGAGGAAATAACAAAAATTTCAATTGATCAATTAACCAAGCTCATTGACCAGGCAGAAAATGAAATTATTTCTGACACAAGTGCTAAAAGTGAAATCCGATTAACTAATGAATTGAGTAACTTGTTACACCCAGCCACTAAAGAAGTATTTGAAAAAGACTCTTTAACATACAAAGATTTAAAAGATTCAAGTAAAATAACTTTACAAAGCAGCGACGGGGATGGCATTAGTAATATTAAAAATTTTGTTCCCTTACAAAAGAAATCATTGGATTCTAACGACAATGCTGAAGTAAATTTAAAAGAACCAACTAAGAACCTAGACGAAATTATAACGTTGCCAACAGAAAAATTAGTATTTCAGAATAAAACAAGTGTAAATGAAGACTTTGAGAGACATTCTGAAAGTTCCAATAAAATAAATAATAAAGTCTCAGATATCGCAATAAAATCATTGTCATCTAAAATATCACTTAATGAGAAAACTGCTGACTTAAATTTGGAAAATCTTAGTAAGAAAAGCTCAAAATATATCGTATTTGAAAAACCACTAAGTGCTGGTTCTGATGATTTGAAAGAAACATCTCTTGAAAATAAATCATCGACATTTAAAGACTTACCAACAAACAATATCGATTCTAAAAGTATGTTAATATCTTACGAAGAAAATCACGACATAGATAATGATAAAAATGGAAGTCTTCGTAACATATCATTAAAACAAGAAATAAATCAATCAAGTTCTCATCATTCAAGAGACAGTCAATTAAGTCTTTTAAAAAAGCTCAGTTCTCTACCACAAATTCAAAATTCAGAAGGCATACTGTACGATAATGTTCAAGATAAAAATACATTCGACAACTCCTCTCATGAATTTACTGATTCTAATATTTTACGAACTGGCCAATACTTGAATAATACTCCAACAAAAATAAAAGACGGCGATAGAGAAGGGATTGATTTAAAACAAAATATCTTACTTCATAAAGGTCTCGACAATAATTTGGATGAATGGATAAGGAAGGATGAAAAGTCCTTTCAAAATTATGTTACAAATAATGTTAACGAAGATATTTTAAGTACATCTCAAACCAATGATTTCATTGAAGACAGCACTCAATTAGATAAAGGTCCGAGAGAAACAGTCGATCATGAATTAATGTCCCTCAAATCTGGTAATAGGGACCAAGGCATTAGTAAAGATTATGCAAAATCACAAGTTTATGACTCGTCAAGTGACATAAAACACACAAGCTCTTCTATATTCCTCAATTTTGCCTTCCACTCGTTAGCAAACGAATTTGGCTACACCATTACAACATTAGATCCTACAACAGATTTAACTGTATACCCACAAAATAAAATAACTACCATTAAAACTGCTCTTAAAGAAAATGATAAAGAAATTAGAATAAGGATGGATTCCGAAACCGATATTATAATTCAAATAAAAAGGAACCATAAGCGTAACGAAGGGTCTAAAAGCATCGTATCTAATGAGGGAAGGAATTTGATGCGTGGATACTGTTCAGAAACAATAATCAATAAAGATCAATTTTTTAAAAATACTCTTAAAACAGTTTATGACACGTTGGTGCCGATAGAAAAAATAATATCCAATCTTAAGGAAGAAGCAGATGTGCTATATCGGGAACAATTATTACTGAGAAAAATTTTGTCATCGAGGGAAATGAAGTCTAAAAGAATTATCCGCACTAATAAAAATTGTAGCTGCCTAGAAAAGGAAATGGGAATCAAGTGA

Protein sequence:

>DPOGS200485-PA
MSFDLIMEPNKDERVRKSHRGLLQMFRPGGCLSIHNDEEEYSYMPGTSNELNRSSSGQEDFHVKTTNEAFSLKKNVFSPSYTTTFKTPKPFLGKCKPGGCLDPPFGEEKYIYKPTFEDNQIAEKSPHTNDINKMLLDSDKNKPELGIKKQTLDQEVEDLHESYTDLTENIEDSESLVTYPLPVSSSYIKNRDSQSKKDRSRINSEQTSQKLTNEFPVSFFMPPRKEKTKQFKTYDDYNDKETKVKEAYINNEYAMQSQTPSEMFSDSLTTTHPSREEPSQDVTREQETSVKSIHSSRNDDISKEPVKSEGLDYISDIKALRKGEEIFSQEQVTANDKSGTGNQNIRSKLPESHIDTSYQSDKMNSIQKMEVENIVSEDKNIEPEKKHSSANVKLSSEKLKINSLSELNNHSDNSEINSIADKPVEGETGLRNGINNLLDPIEKEKFKKNSLKYETSKDFGVNVGKHDSTNLIKPNLKFSHEELTKTSIDDPTKILTYSSSESINDKDIKIENQLRDGVESNLDPTQKEMFEKDSLKYEALRHSKEIITNDDTIKVRKSNVKLSKEDLAEISMDQLNRFSKDLKNESITDINVKRKSQLPNKTSSISDPMQKDALKYEAVIDSGPKITNEDPIQIEKTDVKLGQEEITKISIDQLTKLIDQAENEIISDTSAKSEIRLTNELSNLLHPATKEVFEKDSLTYKDLKDSSKITLQSSDGDGISNIKNFVPLQKKSLDSNDNAEVNLKEPTKNLDEIITLPTEKLVFQNKTSVNEDFERHSESSNKINNKVSDIAIKSLSSKISLNEKTADLNLENLSKKSSKYIVFEKPLSAGSDDLKETSLENKSSTFKDLPTNNIDSKSMLISYEENHDIDNDKNGSLRNISLKQEINQSSSHHSRDSQLSLLKKLSSLPQIQNSEGILYDNVQDKNTFDNSSHEFTDSNILRTGQYLNNTPTKIKDGDREGIDLKQNILLHKGLDNNLDEWIRKDEKSFQNYVTNNVNEDILSTSQTNDFIEDSTQLDKGPRETVDHELMSLKSGNRDQGISKDYAKSQVYDSSSDIKHTSSSIFLNFAFHSLANEFGYTITTLDPTTDLTVYPQNKITTIKTALKENDKEIRIRMDSETDIIIQIKRNHKRNEGSKSIVSNEGRNLMRGYCSETIINKDQFFKNTLKTVYDTLVPIEKIISNLKEEADVLYREQLLLRKILSSREMKSKRIIRTNKNCSCLEKEMGIK-