Monarch geneset OGS2.0

DPOGS209868
TranscriptDPOGS209868-TA1473 bp
ProteinDPOGS209868-PA490 aa
Genomic positionDPSCF300302 - 109589-123034
RNAseq coverage763x (Rank: top 17%)
Annotation
HeliconiusHMEL0075333e-10585.98% 
BombyxBGIBMGA004422-TA2e-7582.94% 
Drosophilabtz-PB2e-3245.37% 
EBI UniRef50UniRef50_Q17GZ32e-3348.65%Putative uncharacterized protein (Fragment) n=1 Tax=Aedes aegypti RepID=Q17GZ3_AEDAE
NCBI RefSeqXP_001656011.14e-3448.65%hypothetical protein AaeL_AAEL002823 [Aedes aegypti]
NCBI nr blastpgi|1571323488e-3348.65%hypothetical protein AaeL_AAEL002823 [Aedes aegypti]
NCBI nr blastxgi|1571323481e-4730.15%hypothetical protein AaeL_AAEL002823 [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[41-158] IPR0185451.9e-26CASC3/Barentsz eIF4AIII binding
Orthology groupMCL18827 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209868-TA
ATGACGTCTGTGGCTAGACGACGAGAACAGGACGATTCGGGCGAATATTCGGACGCATCACAAGATATCGAGCAAAACAATTCTGGAATTGATGGCGCTCAGCATGATACCGACTATGACTCCCAGGGATCGGAAACTGAGCATTCCGAAGGGGATCCAGAGAAACAGGAGACGGAACGCCGCGTTGATGATGATGAGGATCGCAGCAATCCACAGTACATTCCAAAGAGGGGCACCTTCTACGAACACGACGATCGTACAGCGGCCAGCGGCGAGGAAACGACGAACACCACATCAGAGACGGTCTCTGAGAAGAAGGAAGGGGTGGAACCTCCAGCGGAGCGTAGAAGACCACCGAGGAAGGCAGACGCTGATAATAAGTGGGCACACGACAAATACAACGAAAACGAACAGATTCCGAAGTCTCGCGACGAGCTGGTGGCTATATACGGCTATGATATAAGGAACGAAGACGCTCCGCCTCACGCCAGGAGGAACAGGAGATATGGCCGCGGCCCTAACAAGTACACGAGAACGTGGGAGGATGAAGAGGCGTATCGTCGTCAACTGGCTCACAAGAAACCCCCCAGCCCCGCTGATTTCCCTGAACTTGGAGCCACGAACAAACCCGGGTCAACTGCAACCAGCTCAAGGATACGCTCCACCCGCTATGATGCCAGCCCAATACATGCAGATTTTGTAGCCGGACAGCAGTCACAGCACCCAGCGACCGCGGGGCAACAGATGCATCCACAACAAATAATACACCCTCAAGTGCACAGAAATCAGGGTAATCAGCACGGGCCGTCACCGCCTCATCAACCCTCACAACACCCTGCCGGCCAACACCCTCCGGCTCAACACCCTCCAACACAGCTCCCTCCAACTCAACACCCTTCCGGACAACACCCTCCCACTCAACATCCGCCGGGACAGCAACACACGCAGCAGCGTCGCGGTATACACGAGCCTCACGTCCATCACACCATACACAACATGCCGCAGCAACACAATATTGGTCAACTGCAACCAGCTCAAGGATACGCTCCACCCGCTATGATGCCAGCCCAATACATGCAGACGGGTGGAGTGACGTACTACAGTTGTGCTGAGCAGGAACAAGCGCCTAGAGCCGTTCGTCGACCGACAGCGGCCATACCCATAGTACGACCAGATCGACCCGCCAACGACAGGCCGGCGAATTCAACAGAAAAGGACAATATCGATAGGATCGTGGAAAACATGTTCGTGAGGAAGCCATGGCCGGCTCAGGGTAATCTCCCGTTCCTTTTTGTGGAAAGAAATTCTCAAGAGCCGCAGAGCAAAGAAACGTCACAACCGAACAGTTTAACATCCAGTTCGATATCAACGGATTCGAAAACAGAGAAAAACGAGGAGAAACAGGAGAAAACAGAGACTCAGGAGAGCGTCGAACGGACGGACAGCGGGAATATAGACACATCGGACGCTTGA

Protein sequence:

>DPOGS209868-PA
MTSVARRREQDDSGEYSDASQDIEQNNSGIDGAQHDTDYDSQGSETEHSEGDPEKQETERRVDDDEDRSNPQYIPKRGTFYEHDDRTAASGEETTNTTSETVSEKKEGVEPPAERRRPPRKADADNKWAHDKYNENEQIPKSRDELVAIYGYDIRNEDAPPHARRNRRYGRGPNKYTRTWEDEEAYRRQLAHKKPPSPADFPELGATNKPGSTATSSRIRSTRYDASPIHADFVAGQQSQHPATAGQQMHPQQIIHPQVHRNQGNQHGPSPPHQPSQHPAGQHPPAQHPPTQLPPTQHPSGQHPPTQHPPGQQHTQQRRGIHEPHVHHTIHNMPQQHNIGQLQPAQGYAPPAMMPAQYMQTGGVTYYSCAEQEQAPRAVRRPTAAIPIVRPDRPANDRPANSTEKDNIDRIVENMFVRKPWPAQGNLPFLFVERNSQEPQSKETSQPNSLTSSSISTDSKTEKNEEKQEKTETQESVERTDSGNIDTSDA-