Monarch geneset OGS2.0

DPOGS216175
TranscriptDPOGS216175-TA1656 bp
ProteinDPOGS216175-PA551 aa
Genomic positionDPSCF300155 + 324294-337278
RNAseq coverage187x (Rank: top 49%)
Annotation
HeliconiusHMEL0165550.072.29% 
BombyxBGIBMGA014166-TA3e-10875.00% 
DrosophilaCG5149-PA2e-8136.18% 
EBI UniRef50UniRef50_Q7Q3895e-12748.25%AGAP007765-PA n=7 Tax=Endopterygota RepID=Q7Q389_ANOGA
NCBI RefSeqXP_317751.39e-12848.25%AGAP007765-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582975292e-12648.25%AGAP007765-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582975294e-12248.25%AGAP007765-PA [Anopheles gambiae str. PEST]
Group
KEGG pathwaybfo:BRAFLDRAFT_1248865e-09 
 K12587 (EXOSC6, MTR3)maps-> RNA degradation
InterPro domain[237-406] IPR0065711.3e-45TLDc
Orthology groupMCL14865 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216175-TA
ATGGGTAACACTAGCAAAAAGCTGGCAGCAAAATGTGCTCTGCTAACTAAGGAGGAGCAGAAGTATGTTGCGGCAACATTCAGGGCAGCCAGCAAGAACTCGGAGAGAATAAGGGAAGAGGACCTCATCAAGTTTTGGGGTCCGCAAATTGATCCGAGGTTGGCTCAATATCTCACCAATTTTCTCTTCGGCTGCGGTCAACAGAAAACAGCCACAGTGGATTTTAACAGGTTCGCTGAGCTCTACGTCTACAATGTTAGAGGCACTGTCGAGGAGAGAATGATGGTGACATATAACTGTCTAGGTATGGATTACAATGAAGACGCCGAGTTGCCCTATCAACTTTTAAAAGAGTATTGCGAGAGCATAGTGTCGACGTACATGAAGATAGTTAAGTCTTCGTCGACGAAACGCGCGTCCACGTGGTTGGAGAAAGGTTTCAGGGCGAGCGCCTCGCACGTCCAAAGTCTAGGTGAGGCGGTCGCGGCTACCATCGGGGACTTGGAGACGGCGCAGCATCATTGTACAGCAACCCAACTGTCTAAATGGTTGCAATCCAACATCCTTCTGAAGCAGCTGGCGGAGCTAGTGTACGTGAACCTGTATGGTATTAACAGACGTGGTGGTGACGAGAGCCCCACTCCCATGCCACCAGCCGCGCCATCTTTGCTGCCGGCAGTTGAAGGCTTGGAGGCAATGCCGGACTACCCCGCATTCATAGATCTCTCGCACGTCGTGTGGATCAACAGTCATCTGCCGCCGCAGCATCAGCATAAATGGAGATTCCTCTTCTCGACCAACATACATGGGGAATCCTTCTCCACTATGACCGGTCGTATCATCGACCAGGGTCCATCAGTGATCATAGTCGAGGACTCCAGCGGGTATATATTCGGGGGCTTCGCCACAGCCTCGTGGGCCTTCGGTCCAAACTTCACCGGCACCGACGACTCCTTCCTCTTCACGTGCGTGCCTAAGATGAGAGTGTACCCGGCGACCAATTACAACGATCACTACCAGTACCTGAACCATCACACAAAGACCTTGCCCAACGGACTTCTAATGGGTGGTCAGTTTAATTTCGGTGGTATCTGGATATCAGCGGAACCGTTCGGTGATGGTGCGTCCGCTGAGTCCTGCAGCACCTTCCGCGGGTACAGGCGTCTCAGCAAGGAACCGACGTTCAGACTTCGATCACTTGAAGTTTGGGCCGTTGGTGACAAACCTTTGCTCGATAAGGACGGGGACATGAAGACGTCTCAGTCCTCCAGCGTCCTAACTACACATAAATCAGAACGCAATCTGCTGGAGATGATCGGAAAACCTCAAGTCAGCGACGGACTCAGAGATAATTTCGAGGGTCTAGTACTCGAGAAATTAGGCTTCACTTGCAGTAAGAGAAGACCATCCCAGCATGTGAGCAGTCAACAGTGCAATGAAATATGTATAGAAAAGGTCTTTGAAATTTTCGCTGGTTTTAAAAAAAATCTGAGGTCCCCTAATTCCGATCAGGAAATACAGGGAACTAGAGTGGAGACTGGAGAGGATGAGCGTGACGTAAGCATCCTGGATACGAACCCTGAAGCGAAGGCGATACTTGATATGGCGGGGCGGACGCGTCACAGCGAAGGTCTGAGAGAACAGCCACCGTTATAA

Protein sequence:

>DPOGS216175-PA
MGNTSKKLAAKCALLTKEEQKYVAATFRAASKNSERIREEDLIKFWGPQIDPRLAQYLTNFLFGCGQQKTATVDFNRFAELYVYNVRGTVEERMMVTYNCLGMDYNEDAELPYQLLKEYCESIVSTYMKIVKSSSTKRASTWLEKGFRASASHVQSLGEAVAATIGDLETAQHHCTATQLSKWLQSNILLKQLAELVYVNLYGINRRGGDESPTPMPPAAPSLLPAVEGLEAMPDYPAFIDLSHVVWINSHLPPQHQHKWRFLFSTNIHGESFSTMTGRIIDQGPSVIIVEDSSGYIFGGFATASWAFGPNFTGTDDSFLFTCVPKMRVYPATNYNDHYQYLNHHTKTLPNGLLMGGQFNFGGIWISAEPFGDGASAESCSTFRGYRRLSKEPTFRLRSLEVWAVGDKPLLDKDGDMKTSQSSSVLTTHKSERNLLEMIGKPQVSDGLRDNFEGLVLEKLGFTCSKRRPSQHVSSQQCNEICIEKVFEIFAGFKKNLRSPNSDQEIQGTRVETGEDERDVSILDTNPEAKAILDMAGRTRHSEGLREQPPL-