Monarch geneset OGS2.0

DPOGS208155
TranscriptDPOGS208155-TA1407 bp
ProteinDPOGS208155-PA468 aa
Genomic positionDPSCF300058 + 29989-39428
RNAseq coverage515x (Rank: top 24%)
Annotation
HeliconiusHMEL0110781e-17569.57% 
BombyxBGIBMGA013766-TA3e-7575.56% 
DrosophilaCG7131-PA1e-4333.96% 
EBI UniRef50UniRef50_Q7QBL02e-5136.81%AGAP003120-PA n=2 Tax=Anopheles gambiae RepID=Q7QBL0_ANOGA
NCBI RefSeqXP_001121451.19e-5235.35%PREDICTED: similar to CG7131-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3479692928e-5136.81%AGAP003120-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479692921e-6036.81%AGAP003120-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
Orthology groupMCL12877 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208155-TA
ATGCCAGAGTGTGTACCCTGCCCGGCGGCGGCACCTTGTATGCCCACTGCTGGGGTTAAATGTGGGCCAAAAGATGGGAAGCCGATGGACGGACGACAGGAACACTCCTGTCCCTGTCGGTCTTGCTGTTGTGGGAAGCCGGCCGCCGTTAAACCATGCTACAAGCAACCCAGGATACCTGATTCGTACGCTCCACGCCGCTGCTACATGAAGCCAAGCGCCCCAGTGGAATCTTGCACCACATACAAGCTCTCTTACTTGCCTGTTGATGGATGTCAGAACTTGAGAGGCGTCGCTCGCAAACCTCCACCGAACCTTGTACCTAGCTGTGAGCCCATGGAGGCTTGCACGATGTCCTTCTTGCCAAATCCGGTGTGCGTCACACAACCCATACGCCCATGTCACCACGACATGTGGGGCCAGGGACCGATGCAGAACATCACAACACAACGTCACGACTACGTTCCTAAGCCCTGCGTACTCCGCGAGAGTTGCAAGCCACCGGCGAAGTTCCATTGTGTGGAACAGCCCTTTGAAAATCGCACTGTGAACAAGCTGTCTTACTTGCCGCCTGAGAAGATTGAATTGGTCAAATCATACGCCCCGGAACGTTGTTACGAGCGACCAGCGGCTAAAATGGATGGCAGCACTACGCACAAATTGAGCTATATGCCAAATCAGATCATGCCAAAGGAGCCTCTGCCTTGGGCTTGTAAAGGACAATATCAAAAGCCCTGTCAGAAGTTGGAGGGAAACACGACTTATACAATGAGTTATTTGGATTCTCAAAGTGATTGTCGAAGGCGCGCCATAATACCAGACAGCTGCACCAATCCCGTAACGGCTTCGAAAAGATTCGAAACACAGACTATCTATAAGAACAGTTATCTTCCAACGACGGCCCCCATTCCTCATCCTATAAAGCCTCTGCCAAATCTCGTTCCATCCACAGCGCAAATGGAAGGCGATACTGTCCAGAAGTTATCCTTCTTGCCAAATCCGGTGTGCGTCACACAACCCATTCGCCCATGTCACCACGACATGTGGGGCCAGGGACCGATGCAGAACATCACAACACAACGTCACGACTTCGTGCCCAAGCCGACTGTACTGCGCGAATCTTTCAAACCGGCTCATAAATTCCATTGTGTAGAACAGCCCTTTGAAAATCGAACAGTAAATCGGATGTCGTATTTAGACCCGGGAAGGCAGCCGCCTCCGGAGTCCTATGCTCCTAGCAGATGTTACGAGAAACCTGCAGCCAAAATGGAAAGCGATACGATACAGAAGATGTCCTATCAAGCTGTGTGCGCGTCAGCACCTCAACGGCCGCCGTGGGCATGCAAAGGGCAGTACCAAAAGCCCTGTCAAAAGCCTGTAATTGTGTCTGTCCCGCTCAATGCATAG

Protein sequence:

>DPOGS208155-PA
MPECVPCPAAAPCMPTAGVKCGPKDGKPMDGRQEHSCPCRSCCCGKPAAVKPCYKQPRIPDSYAPRRCYMKPSAPVESCTTYKLSYLPVDGCQNLRGVARKPPPNLVPSCEPMEACTMSFLPNPVCVTQPIRPCHHDMWGQGPMQNITTQRHDYVPKPCVLRESCKPPAKFHCVEQPFENRTVNKLSYLPPEKIELVKSYAPERCYERPAAKMDGSTTHKLSYMPNQIMPKEPLPWACKGQYQKPCQKLEGNTTYTMSYLDSQSDCRRRAIIPDSCTNPVTASKRFETQTIYKNSYLPTTAPIPHPIKPLPNLVPSTAQMEGDTVQKLSFLPNPVCVTQPIRPCHHDMWGQGPMQNITTQRHDFVPKPTVLRESFKPAHKFHCVEQPFENRTVNRMSYLDPGRQPPPESYAPSRCYEKPAAKMESDTIQKMSYQAVCASAPQRPPWACKGQYQKPCQKPVIVSVPLNA-