Monarch geneset OGS2.0

DPOGS204666
TranscriptDPOGS204666-TA3135 bp
ProteinDPOGS204666-PA1044 aa
Genomic positionDPSCF300170 - 356943-379915
RNAseq coverage36x (Rank: top 74%)
Annotation
HeliconiusHMEL0162502e-13255.98% 
BombyxBGIBMGA007468-TA2e-13855.20% 
Drosophilacutlet-PA1e-6132.10% 
EBI UniRef50UniRef50_D6WXA22e-6130.32%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WXA2_TRICA
NCBI RefSeqXP_973644.12e-6230.32%PREDICTED: similar to cutlet CG33122-PA [Tribolium castaneum]
NCBI nr blastpgi|910889415e-6130.32%PREDICTED: similar to cutlet CG33122-PA [Tribolium castaneum]
NCBI nr blastxgi|2700123641e-5730.77%hypothetical protein TcasGA2_TC006507 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL13092 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204666-TA
ATGGGGGATTTTCCAGACCCCGATGAAGAATATGAACTTATGTATTCCGATGATCTTGAACTCATTCGTGAGATTGATGATGGCACAGATTCAAAAATAGTCAAAAATAAGGTTTTGCCAGCGAAGAGAAGTCTAGATTTCTCAAGTCCAGTACAAAAACCTCAATTAAATAATAAACAAAATAGTGTAAATGAGTCAATTTTATCAGAAAGCTCTCAATCTCCATCAGAAATCATTAGCACAATTCCAACTAAGAGAACCGCTGAAGATCTTTTTGGCGACATCAATGATATAGACTTTGATGACTTAGGCTTACCGAGTAAAAGACAAAAGACTGAAGAAGAAAATGATTTAGAACTGATCAACAAGATTATTGAGGGTCGAAAAATGAGACAAATGCTATCAGAACCGAACAGAGCATTATTAGAAAATATGCCTTATTATAGTGCCAGTAAAAACTTGTCACTGAAAATACCAAAGTGGGCCTTCATGCCGTTTACAAATTGGGATGGAGATAGGGTTTATGTTCGCATGGAGTCTGAGGATAGCTGGGAAGACTCCCTAGATCTTGCACCAGATCAGATCTCTTTGAAGTCTATGTTCAGACCAGTTTGGGAGGAGGCTCGGAAAATATTGGAAGCCAAGCGAAATAGAATTGAAAATGATTCAGTTCAAGAAACCACAGATGATAAACAGGACTTCACCGATCTTTGGGTTGAGAAGTACAAACCTAAGTCCTACATTGATTTGCTATCGGAAGAACCTGTGAACAGGCCTATATCCCTGATAGTGACTGTGGGTGCCGTGTGTTCAACACGTCTAAGCTCGCGTCTCAATCTGGTGGCCAAGTGTGAGCGTGTTCATGTTGCTGCGAGGGCTTTGACAGCGTTAGCGGCACGAGCCAGAGGTGATGTACGGTCAGCGCTGCACGCCCTAAGCTTCTTAAGGGCTGGAGGCAGGAACGGGGACCAGATTAAGGTAGAAGATGTAGAGAATATAGCAATTGGTACCAAAGACAACTCTCAGAGTGTGATGCAAGCTCTACAAACAGTATTCACTCTCAACAATAATGATCATGAGGCTGTGTTGAAGGTTATCCAGGCCGCTGGGGAATATGAGAGAATAGCCGACGGCATTTTTGAGAACTACCCGTGCTCCCGTTCAGATTCAAGACTTCTGGTTTCGTGCGCCATCAGCGAACTGCTGGCTTTGTTCGACATAACCAGCAGCTGGATCCTCCGTACTTCCAACTACAGTATGTACGGAGTACTGCCATCTCTGCTGGCCAGGTGTCACGGTATTCTGGCCACCCGTCAACCACCAAGAGTGAAATTTCCACTTACTTCACAGGAGATAGTTTTCAACAAAGATCTCAAAAGCGTCGAGCCCCAGCAACGTAATTCATTCAGCAAACGAACTGGGCAATTCCAATTGAGCAACAAGTGGCGTAACAAGAAGGATGAACCGGAAATGGATGAAGATGGGAGACCGCATCATAAAGTGGCACTCGTGTGTGGACCCCCAGGTGTTGGCAAAACCACACTGGCGCATCTATTGGCTAGACTGGCTGATGAAATAGACGGAGCTCCTATACAGACAGTAGAGTTACTTGTGAGGTGGTGTACGGCTAAGAGGGATGAGGGGAAGAAGAAAAGCAGGGTCCATGTGTTGAAGAGGCCTGTTATAGCTATCTGTAATGATTTATATGCTACGTCTCTAAGGCCTTTGAGGCCTATATCCCTGATAGTGACTGTGGGTGCCGTGTGTTCAACACGTCTAAGCTCGCGTCTCAATCTGGTGGCCAAGTGTGAGCGTGTTCATGTTGCTGCGAGGGCTTTGACAGCGTTAGCGGCACGAGCCAGAGGTGATGTACGGTCAGCGCTGCACGCCCTAAGCTTCTTAAGGGCTGGAGGCAGGAACGGGGACCAGATTAAGGTAGAAGATGTAGAGAATATAGCAATTGGTACCAAAGACAACTCTCAGAGTGTGATGCAAGCTCTACAAACAGTATTCACTCTCAACAATAATGATCATGAGGCTGTGTTGAAGGTTATCCAGGCCGCTGGGGAATATGAGAGAATAGCCGACGGCATTTTTGAGAACTACCCGTGCTCCCGTTCAGATTCAAGACTTCTGGTTTCGTGCGCCATCAGCGAACTGCTGGCTTTGTTCGACATAACCAGCAGCTGGATCCTCCGTACTTCCAACTACAGTATGTACGGAGTACTGCCATCTCTGCTGGCCAGGTGTCACGGTATTCTGGCCACCCGTCAACCACCAAGAGTGAAATTTCCACTTACTTCACAGGAGATGTCCCGTAAGAAGACGGAGTTGGACAGCATCCTGTGGTCTATATGGCGTGGGTCAAGCGTTTACAACACGAAGAAATCTCTCAAGCTAGATATCGTTCCACTTCTCCCATATATCTTATCTCCCATGCTCAGATCAGCTAATATACAGTTATGCTCTGATAGCGAGCGGAAGTCGGTGTTGTCGTGTGCTGGAGCCATGTGTGATTACGGACTAACGTACATACAGCGACGTGAACAGGCCGGCTACGAACACGTTATAGAGCCCGACCTTTATAGGCTGGCTTTGTTCGGTGAGACTTCGAGTACTGTAGTTTCGATAGTCCCAGGGCGAAGAATAGCCGACGGCATTTTTGAGAACTACCCGTGCTCCCGTTCAGATTCAAGACTTCTGGTTTCGTGCGCCATCAGCGAACTGCTGGCTTTGTTCGACATAACCAGCAGCTGGATCCTCCGTACTTCCAACTACAGTATGTACGGAGTACTGCCATCTCTGCTGGCCAGGTGTCACGGTCCAGAAGACAGAATGCGTCCTTCACCAGCCGTTCGCCAAGCTATAGCAGCACAACAACAGTTAGAAGTCATCCGACGGAATGAAGATATGATGAACAGAACTTCCGGGAGAACAGAAGAGAAGTGTGAAAGGAGAGCTCCAAAGAAAGAAGTTTCAGTGGTTGGAGTGAAAGAGTCCAGATTACCAAATCACTTGCAGCGGTTGCAGCCCAAAGCTATTGAACGTGCCTTGCCTCAGATGCGTTCCCCGACCGCGGCAAATTGTGGATTAGAGCGCGTTCCCATTCTCTCGGACGGCTTTTATGACGGGTGA

Protein sequence:

>DPOGS204666-PA
MGDFPDPDEEYELMYSDDLELIREIDDGTDSKIVKNKVLPAKRSLDFSSPVQKPQLNNKQNSVNESILSESSQSPSEIISTIPTKRTAEDLFGDINDIDFDDLGLPSKRQKTEEENDLELINKIIEGRKMRQMLSEPNRALLENMPYYSASKNLSLKIPKWAFMPFTNWDGDRVYVRMESEDSWEDSLDLAPDQISLKSMFRPVWEEARKILEAKRNRIENDSVQETTDDKQDFTDLWVEKYKPKSYIDLLSEEPVNRPISLIVTVGAVCSTRLSSRLNLVAKCERVHVAARALTALAARARGDVRSALHALSFLRAGGRNGDQIKVEDVENIAIGTKDNSQSVMQALQTVFTLNNNDHEAVLKVIQAAGEYERIADGIFENYPCSRSDSRLLVSCAISELLALFDITSSWILRTSNYSMYGVLPSLLARCHGILATRQPPRVKFPLTSQEIVFNKDLKSVEPQQRNSFSKRTGQFQLSNKWRNKKDEPEMDEDGRPHHKVALVCGPPGVGKTTLAHLLARLADEIDGAPIQTVELLVRWCTAKRDEGKKKSRVHVLKRPVIAICNDLYATSLRPLRPISLIVTVGAVCSTRLSSRLNLVAKCERVHVAARALTALAARARGDVRSALHALSFLRAGGRNGDQIKVEDVENIAIGTKDNSQSVMQALQTVFTLNNNDHEAVLKVIQAAGEYERIADGIFENYPCSRSDSRLLVSCAISELLALFDITSSWILRTSNYSMYGVLPSLLARCHGILATRQPPRVKFPLTSQEMSRKKTELDSILWSIWRGSSVYNTKKSLKLDIVPLLPYILSPMLRSANIQLCSDSERKSVLSCAGAMCDYGLTYIQRREQAGYEHVIEPDLYRLALFGETSSTVVSIVPGRRIADGIFENYPCSRSDSRLLVSCAISELLALFDITSSWILRTSNYSMYGVLPSLLARCHGPEDRMRPSPAVRQAIAAQQQLEVIRRNEDMMNRTSGRTEEKCERRAPKKEVSVVGVKESRLPNHLQRLQPKAIERALPQMRSPTAANCGLERVPILSDGFYDG-