Monarch geneset OGS2.0

DPOGS214747
TranscriptDPOGS214747-TA1515 bp
ProteinDPOGS214747-PA504 aa
Genomic positionDPSCF300022 + 690673-692187
RNAseq coverage327x (Rank: top 35%)
Annotation
Heliconius% 
BombyxBGIBMGA004741-TA2e-18062.83% 
DrosophilaCG6049-PB7e-10952.70% 
EBI UniRef50UniRef50_Q2LYN42e-10844.51%GA19321 n=2 Tax=Sophophora RepID=Q2LYN4_DROPS
NCBI RefSeqXP_001811064.14e-12857.87%PREDICTED: similar to DEAD box ATP-dependent RNA helicase [Tribolium castaneum]
NCBI nr blastpgi|2700162854e-12757.87%hypothetical protein TcasGA2_TC002366 [Tribolium castaneum]
NCBI nr blastxgi|2700162851e-14156.72%hypothetical protein TcasGA2_TC002366 [Tribolium castaneum]
Group
Gene OntologyGO:00001662.5e-14nucleotide binding
GO:00036763.4e-08nucleic acid binding
KEGG pathway 
InterPro domain[218-312] IPR0126772.5e-14Nucleotide-binding, alpha-beta plait
[228-309] IPR0005043.4e-08RNA recognition motif domain
Orthology groupMCL14855 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214747-TA
ATGGCTAAACCTGAAAAAACTAAGTCTGGATTTGTTATTAAACTTTCAAATGAGACTGTGCAAAAGCTAAACGAAGAAGAGAAAAATCTGCAAAGCGAAGGTGAGGATTCAAAGAATAAACAAGATTCTACTGGATTACTAAATGATATAATAGAGAAAGCATCAGTATCAGACACTAACGACATTACAAAAGTAGAAACACAGCAAGAAGAGTGTGCACCTTTAACGGAGAATAGTGAAAAAGGAAAAGGTGACAGTGAAAATAAGGTTGAAACCAACGAACTATGGGGTGATTACTCTCCATACATATCATATGAAGGAGTTGAAGCTATTTATACAGATCCTAATAACAAACAAAAATACACATGGAATAAGGAATCAAACTCATGGGTCCCCAAAGGAGAACCCGAGGGTCGTACATATAGTTATGAAAATGACACTCACATTTACACAGAATCAGATGGCTCTAAATTCTTTTGGGATGATGAGAAAAAAGCTTGGATTCCTAAAGTCGATGATGACTTTCTTGCTTTATACCAAATGTCCTACGGTTTTGTTGATAACACATCAGTAAAGAATGAAGATCAGGTTAAATTGAAGGAAGAGAGGAAAAAACTAGACGCTGGTGTGAAAAGAAAAAGTGAACCCACTTGGTTTGAACAGTCGGATGAAAAAAATACAAAGGTTTATGTATCAAACTTGCCCACTGATCTAACGGAAGAGGACTTCGTCAATCTGATGCAGAAATGTGGTCTGGTTGAAAGAGATCCTGTGAACCAGAAGATGAAGGTCAAATTGTACATGGACAAAGAACAGAACTGTTTCAAAGGTGATGCTCTCTGCACATACATAAAGATAGAGTCCGTCGACCTAGCGTTGAAGTTACTTGACGGAAGTGATTATAAAGGAAACAAGATAAAAGTTGAAAGAGCACAATTCCAAATGAAAGGTGACTACAATCCAGCTCTGAAACCTAAAAAGAAAAAGAAGAAGGAATTAGAGAAGCTCAAGAAAATGCAGCAGAAGCTATTTGACTGGAGGCCTGAAAAATTTATAGGAGAGAGATCAAAACACGAACGGATAGTTATAGTTAAAAATTTATTCCATCCGTCAGATTTTGATAATGATGTTCAGCTTATACTTGATTACCAGCAAGACTTGAGAGAGGAGTGCAGCAAGTGTGGAGAGGTGAGGAAGGTGGTTATATATGACGCACATCCCGAAGGCGTCGCACAGATCACTATGAAAGAACCCGAACAAGCCGACGCTGTCATCCAGCTAATAAATGGCAGGTGGTTTGGAAAACGACAAATCACAGCAGAAACATATGACGGTCGAACAAAATATAGGATAGCGGAGACCGACGCTGATATTAATAAGAGAATAAACAAGTGGGACAAATTCCTGGAAGAAGAAGAAGCTAAGAAAGACAAGAATACAACAAATGAAAGCAAATCTAACACGTCAGAGAGTGAGAAGGTATCAAGTGAAAAAGAAGAATCAGTTGAAAAATGA

Protein sequence:

>DPOGS214747-PA
MAKPEKTKSGFVIKLSNETVQKLNEEEKNLQSEGEDSKNKQDSTGLLNDIIEKASVSDTNDITKVETQQEECAPLTENSEKGKGDSENKVETNELWGDYSPYISYEGVEAIYTDPNNKQKYTWNKESNSWVPKGEPEGRTYSYENDTHIYTESDGSKFFWDDEKKAWIPKVDDDFLALYQMSYGFVDNTSVKNEDQVKLKEERKKLDAGVKRKSEPTWFEQSDEKNTKVYVSNLPTDLTEEDFVNLMQKCGLVERDPVNQKMKVKLYMDKEQNCFKGDALCTYIKIESVDLALKLLDGSDYKGNKIKVERAQFQMKGDYNPALKPKKKKKKELEKLKKMQQKLFDWRPEKFIGERSKHERIVIVKNLFHPSDFDNDVQLILDYQQDLREECSKCGEVRKVVIYDAHPEGVAQITMKEPEQADAVIQLINGRWFGKRQITAETYDGRTKYRIAETDADINKRINKWDKFLEEEEAKKDKNTTNESKSNTSESEKVSSEKEESVEK-