Monarch geneset OGS2.0

DPOGS204571
TranscriptDPOGS204571-TA1242 bp
ProteinDPOGS204571-PA413 aa
Genomic positionDPSCF300300 + 230079-232996
RNAseq coverage318x (Rank: top 36%)
Annotation
HeliconiusHMEL0083832e-16871.57% 
BombyxBGIBMGA001431-TA5e-15688.81% 
DrosophilaCG1105-PA2e-10445.77% 
EBI UniRef50UniRef50_E3WVS62e-10250.14%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=E3WVS6_ANODA
NCBI RefSeqXP_321170.44e-10847.39%AGAP001894-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1583014927e-10747.39%AGAP001894-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571180981e-11148.00%hypothetical protein AaeL_AAEL008185 [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[8-155] IPR0110212.5e-43Arrestin-like, N-terminal
[5-160] IPR0147562.6e-36Immunoglobulin E-set
[182-291] IPR0110221.4e-16Arrestin-like, C-terminal
Orthology groupMCL12595 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204571-TA
ATGGGTATCAAAGAAGCTGTTATTTACTTGGACAACCAATGGAATACTTATTATGCTGGTCAAACAGTTAATGGAAGAATAGAATACGTTTTTGATAGTCCAAAAAAAGTTAGAGGTATACATGTTAAAATTAAGGGCGAAGCTCACACTGAGTGGAGTGAAAGTAAGGAAGAACAAGATGCAGAAGGTAAAACACAATCTACAGATACTCTTCACACCGGAAATGAAGAGTACTTCCAAATATCATACTATTTACTTGGGAGTAATACAGGTAACGAAATAGAAATACCAGCTGGCAAGCAAGTATACAATTTCACTTGTACTTTACCACCCGTACTGCCATCGTCCTTTGAAGGCCAACATGGTTTTGTCAGATACACAATAAAAGTTACTTTAGATAGACCATGGAAGTTTGATCAAGAAACAAAAATGGCATTTACAGTTATTAATGCATTGGATTTGAATCTCAATCCTTCTTACCGGGAACCTATACACTTCCAAATGGAGAAGACCTTCTGTTGTTTCTGTTGTGCGTCCCCACCTTTATGTGTTGATGTGAGAGCTCCTGTTTCAGGGTATTGCCCTGGTCAAGTTATACCTTTGACTATAGATATTGAAAACAAGAGTAATGTGCAACTACATTTGGTGAAAATATTCTTGAGAAAGGTGGTGAATTATAGGGCAACCTCGCCAAGTACTTCGACACGAAAAACTAAAGATGTCATTCTAACAGTTCAAGAGGGTCCTGCTCCAGCTGGAACAACAAAGAATTGGAATTTGACAATGGAAATACCTCCAATACCACCCTCCAACTTAGTAAACTGTAACATCATAGATTTAGATTATGATTTAAAGCTAAAACTGAAAACATTNGTCACCATTGGAACAGTGCCATTAGTTAACGCTGGTCAACCCGTGCCATCTCCATCTGCGCCAATGCAACCAAGCAATGATTCCAGCCAGCCGGCTGTTCTGCCGGTAGGGCCAGGCGAACCAGCCGTGCCTTCTAATGTGCCAAGTGGTGACGGCGCTCCAGGGGGCTGGGTGGTTCCAGGTGAACCNNNNNNNNNNNTGTATGTTAAATATTTCTTCACAGCACAACCGGTGTATAAAGAATCTCAGTATGCTGCTCGGACGATCCAAGATAGAGGTGAAAGCAAAAACATGGCTATAACCGGAGCTGCCAATTTCGCGCCATATTATCCGACGTACGCTTGGCAACCGCCGCCATTACCGCAGTGA

Protein sequence:

>DPOGS204571-PA
MGIKEAVIYLDNQWNTYYAGQTVNGRIEYVFDSPKKVRGIHVKIKGEAHTEWSESKEEQDAEGKTQSTDTLHTGNEEYFQISYYLLGSNTGNEIEIPAGKQVYNFTCTLPPVLPSSFEGQHGFVRYTIKVTLDRPWKFDQETKMAFTVINALDLNLNPSYREPIHFQMEKTFCCFCCASPPLCVDVRAPVSGYCPGQVIPLTIDIENKSNVQLHLVKIFLRKVVNYRATSPSTSTRKTKDVILTVQEGPAPAGTTKNWNLTMEIPPIPPSNLVNCNIIDLDYDLKLKLKTXVTIGTVPLVNAGQPVPSPSAPMQPSNDSSQPAVLPVGPGEPAVPSNVPSGDGAPGGWVVPGEPXXXXYVKYFFTAQPVYKESQYAARTIQDRGESKNMAITGAANFAPYYPTYAWQPPPLPQ-