Monarch geneset OGS2.0

DPOGS207331
TranscriptDPOGS207331-TA2064 bp
ProteinDPOGS207331-PA687 aa
Genomic positionDPSCF300188 - 264105-269626
RNAseq coverage541x (Rank: top 23%)
Annotation
HeliconiusHMEL0088542e-9277.63% 
BombyxBGIBMGA010107-TA2e-16476.76% 
DrosophilaAtu-PA7e-11061.64% 
EBI UniRef50UniRef50_E0VUD94e-11647.80%RNA polymerase-associated protein LEO1, putative n=1 Tax=Pediculus humanus corporis RepID=E0VUD9_PEDHC
NCBI RefSeqXP_966460.12e-11856.24%PREDICTED: similar to AGAP003242-PA [Tribolium castaneum]
NCBI nr blastpgi|2700071114e-11856.57%hypothetical protein TcasGA2_TC013564 [Tribolium castaneum]
NCBI nr blastxgi|1955020218e-16450.70%GE10143 [Drosophila yakuba]
Group
KEGG pathway 
InterPro domain[336-658] IPR0071494.8e-86Leo1-like protein
Orthology groupMCL13924 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207331-TA
ATGGCTCCAAATGGAAGAAGAGGCTCAGTGGACACTGATTCCGGTTCCGATTCCGACAGCGGTTCCAGTGCAAGTCACAGCAAAAGTGCTAGTCCGGCACCATCTGGTAAAGAAGGCAGTCAGTCTCGATCGGTATCCAGAAGTCCCGCAAAATCTGGAAGTGAATCTCCAAAGTCCAATCGCTCTGCTAGATCACGGAAATCATCGAACGCATCAAACGCGTCCCGTTCAAAATCAAACTCCCCAGCTGGGTCTAACAGGTCAGGATCCGCGCAGAGCAACAAATCCGCTTCGTCAAGTCCAAAATCAAGGGCATCCAGAAGTCCCAGGGCGTCAAAATCCAGATCCAGATCCCGTAGTGGGTCAGGCAGTGCTAGATCTAGATCCGGTAGCGCTCGTTCCAGGTCAGGTAGTCCGAAATCCGGTGCACAGAGCCCAAAGTCCAGATCGCAGAGTCCAAAGTCACGATCGCAAAGCCCTAAGTCTCAGGGTGCTAAAAGTGGGTCTCGTTCGCGTAGTGGAAGTCCTAAAAGCAGGAAGTCGAGATCTAGGTCAGGAAGTGCGTCATCAAGATCGAAGAGTCCAGAAGCCAGACCGGATGTAGCCGACAGCAGATCCAATTCACCAAATCTCATGATTGACGACCAAGCCAAGGAAAAATCAGGAAGCAGATCTAGATCGCACTCAAAATCCAGATCCAGAAGTAAATCTAAGTCAAGATCCAGAAGCAAATCGAAGTCGAAGTCAAAAAGCCGTTCGCGATCACGTTCGAAGAGTTCTAACGCTTCAGATGTCGGCGGTAAGAAGAAAAGCTCCGTGCTATCTGACTCGGAGAGTGATGCTGGGCAGAAAGGTCCCAAACGTAAGAAAGATTCAGACAGCGGCTCCGACACGAGCAACAAACCTAAGAAGAAGACAAAGAAACTTGACTCTGATGATGACAATCAGGAGGCGACCGTGACAGCTGATGCGTTATTCGGCGACGCGTCCGACATCAGTACTGACAATGAGGGTGAGAAGGAAAGGTCGAGGTCGAGGTCCAAGTCGAGGTCGAGGTCACGCAGCCGCAGCAGGTCCGGTGACGAGAGGCGCTCAGACGACGCCAAGGGGAGTGGAGACGAGGAAAATAGGGAGAAACCCGAAGAGGAAGAGGAGATTGAGATCCCAGAAACTCGTATAGATGTGGACATGCCTAAAATATGGACGGAACTTGGCAAGGAATTGCATTTCGTGAAGCTTCCCAATTTTTTGTCAGTGGAAACTAGGCCTTACGATCCAAATACATATGAAGATGAGATTGATGAAGAAGAAACGCTCGATGAAGAAGGTCGTGCGAGGTTGAAGCTCAAAGTGGAGAATACGTATGCGACCGCCTGCCACAAGCTTAAAGAAGGTAACGCTGTGAAGGAATCCAACGCGCGGATGGTGAAGTGGTCCGACGGGAGCATGTCCTTACACCTCGGCTCCGAGATCTTTGATGTTTACAAGCAACCTCTACACGGCGACCACAACCATCTGTTCGTCCGTCAAGGCACGGGTCTCCAGGGCCAGGCGGTGTTCCGCACCAAGTTGTCGTTCAGACCTCACTCCACGGACTCGTTCACACATCGCAAGATGACGCTGTCTGTGGCGGACAGGTCCACGAAGACGTCCGCTATAAAAATACTGTCGCAAGTAGGCAGCGACCCTGACGCGGACAGGAAATATCAGCTGAAGAAAGAGGAGATGGAGCTGCGTGCTGCGATGAGGTCCCGGGTGTCCAGTCGACCCAAGAGGAGGGCGGGCGGGGGCGGGGGGGCCCGCGCTCACAGGCACGACGACTCAGAGGACGAGGGCGGGGTGTCGCTGGCGGCCATCAAGAACAAGTACAAGGCTGGACAGAAAGCGAGCGCCGGGGCCGCGATCTATTCGTCGGAGTCTGACGGCTCGGATGTGGAGACCCGTCGCGCTAGGAGGCTGGACAGGGCGAAGGCTTTGAAGGACTCCGACGACGAAGCGAGTCCCGGGAACAACACGCCGCAGCAGAGTCAAAGCGGCTCGGGCTCCGGCAGCGGCAGCGACTGA

Protein sequence:

>DPOGS207331-PA
MAPNGRRGSVDTDSGSDSDSGSSASHSKSASPAPSGKEGSQSRSVSRSPAKSGSESPKSNRSARSRKSSNASNASRSKSNSPAGSNRSGSAQSNKSASSSPKSRASRSPRASKSRSRSRSGSGSARSRSGSARSRSGSPKSGAQSPKSRSQSPKSRSQSPKSQGAKSGSRSRSGSPKSRKSRSRSGSASSRSKSPEARPDVADSRSNSPNLMIDDQAKEKSGSRSRSHSKSRSRSKSKSRSRSKSKSKSKSRSRSRSKSSNASDVGGKKKSSVLSDSESDAGQKGPKRKKDSDSGSDTSNKPKKKTKKLDSDDDNQEATVTADALFGDASDISTDNEGEKERSRSRSKSRSRSRSRSRSGDERRSDDAKGSGDEENREKPEEEEEIEIPETRIDVDMPKIWTELGKELHFVKLPNFLSVETRPYDPNTYEDEIDEEETLDEEGRARLKLKVENTYATACHKLKEGNAVKESNARMVKWSDGSMSLHLGSEIFDVYKQPLHGDHNHLFVRQGTGLQGQAVFRTKLSFRPHSTDSFTHRKMTLSVADRSTKTSAIKILSQVGSDPDADRKYQLKKEEMELRAAMRSRVSSRPKRRAGGGGGARAHRHDDSEDEGGVSLAAIKNKYKAGQKASAGAAIYSSESDGSDVETRRARRLDRAKALKDSDDEASPGNNTPQQSQSGSGSGSGSD-