Monarch geneset OGS2.0

DPOGS210590
TranscriptDPOGS210590-TA2070 bp
ProteinDPOGS210590-PA689 aa
Genomic positionDPSCF300168 - 372657-378084
RNAseq coverage375x (Rank: top 32%)
Annotation
HeliconiusHMEL0030832e-13750.33% 
BombyxBGIBMGA013538-TA4e-17756.54% 
DrosophilaCG7154-PA1e-5061.59% 
EBI UniRef50UniRef50_Q5TNH76e-8337.89%AGAP009307-PA n=5 Tax=Coelomata RepID=Q5TNH7_ANOGA
NCBI RefSeqXP_553263.21e-8337.89%AGAP009307-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1583001132e-8237.89%AGAP009307-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3072085014e-9137.91%Bromodomain-containing protein 7 [Harpegnathos saltator]
Group
Gene OntologyGO:00055154e-35protein binding
KEGG pathway 
InterPro domain[69-191] IPR0014874e-35Bromodomain
[322-432] IPR0219009.9e-26Protein of unknown function DUF3512
Orthology groupMCL14199 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210590-TA
ATGGACGAGTTAGAAAAACTAAGTTGTAAGGAAGAAATATGCGACGTGAGTGATATGGAACATTCAGCTACCGACGATCATAAGGTAACAGAAGGGAAAATAAAGAAGAGGCGGAGGAGGCGGAGGGAGATTGATTTAATTTCTGAAGACGATGAAACAGAAGATGATGATATACCCAAGAAAATGTACCGACGGATCAGCCCCGGCCCTGGACCGATATCACCCGCATCTCACAGAGAACCACGCACTTGTGTTTTAAAAGCTAGCCAGAAGCGGAAACCATTGTCTCGGCTACTAGAACAGCTGCTGAGGAATCTTGAAAAGCGTGATCCAAACCAATTTTTTGCTTGGCCGGTGAATGACAACTTTGCACCCGGCTACTCCACCATCATCAAGAAACCCATGGACTTCTCAACCATGAAGCAGAAGATTGATGATAACGAATATAAATCATTGAATTGTTTTATAAGTGACTTCAAGCTGATGTGTAACAATGCAATGAAATATAATAAACCTGGAACGGTATATCACAAGGCGGCTAAGAGATTGCTGCATGCTGGCCTCAAACAGCTCACTCCACAGAAACTGAGACCGCTCGGAGACATCCTCACCTACATGTACGAGATCCCTCTCAGAGAACTCGGCTTTGACATTGGCAAAATGGATGTGGTGAGTCCCACGCAACAAGATTCAACGCCGTTAAGGACGTACGGTCGGAAGGTTGGCATTATAAGTATACTATGTAGTGACATGCACGAGGGTTGGTACATGTGTCACAACAGAACAATACTGATAGCGGAGGCGGAACGAGACTCCCAGTCCAGGTCCTCGTGGACAAGTCACAATGAGTCCGGCGCTCCAGATCATAAGGTGCTGAAGAGGAGCAGTCCCGTGAAGAGCGGCGGGGAGGAGGAGGCGGCCTCGGACGGCGGGGAGGTCGGGGACGGTGGGGGGGACGGGGTTAAGATACAGATGGAAGCCGCCAGGGAACAGCATCGTCGGAGGCTGGCCAAGAAAGCCTTCCCCCGTATGGACGCAGAAGGAAAGACCACGCTGAAGCTGGTCACTCACACCGTGGACGGCAGCGGAGAAGACAAGCCCTTGACCCTGGGACAGTACATCGGCAAACTGACCCAGGGCACGGGCTCCTTACACATCCCTCGCGAGGATCGGCGCAACATCGCCAAGTGCGTGAGGCCGCTCCACTACGGGCCATTTAGCTCGTACGCTCCCTCATACGACGGAACGTTCGCCACACTCTCCAAGGAGGAGTCACACCTCGTGCTACACACGCTGCGTGAGTACACACACGCACCGGACAAGAGGAACCGGCCCGAGACGCCTCCGCCCAGGGACGAGAGCCAGGAGCTGTCAAAAGTGAAGATCGACATCGACGAGCTGCGGTCACTGTCCTCGCTCGGCATCGACGTGGACTTCCTCAACGAGCTGGAGATGGCCACCAAGGACTACGGCCTAGGACCCGCGCTTAAACACACATACGGCCTCCTCGCCGCCCTCGAGAAGGAGCAGCGGGAACGTCTGTCCGCGTCGTCTCCGTGGCACCTGTCGCTGGTGGCGGGCGCGGGCCAGGCGGAGCGCGAGGCGGCGCGGGCGGCGGCGGGCTGGCTGCGGGCCATGGTGGCCAGGGTCCCGCCCAAGGAGGTCGCCACACACGCCGCCCTCAGGAACGCCATGGGCGTCACCCTGGAGCATCTCGAGGGTGATCCTCCAAGGGCCGAGAGCCAGGAGCTGTCAAAAGTGAAGATCGACATCGACGAGCTGCGGTCACTGTCCTCGCTCGGCATCGACGTGGACTTCCTCAACGAGCTGGAGATGGCCACCAAGGACTACGGCCTAGGACCCGCGCTTAAACACACATACGGCCTCCTCGCCGCCCTCGAGAAGGAGCAGCGGGAACGTCTGTCCGCGTCGTCTCCGTGGCACCTGTCGCTGGTGGCGGGCGCGGGCCAGGCGGAGCGCGAGGCGGCGCGGGCGGCGGCGGGCTGGCTGCGGGCCATGGTGGCCAGGGTCCCGCCCAAGGAGGAACAAATACAAAGATTATTACTATGA

Protein sequence:

>DPOGS210590-PA
MDELEKLSCKEEICDVSDMEHSATDDHKVTEGKIKKRRRRRREIDLISEDDETEDDDIPKKMYRRISPGPGPISPASHREPRTCVLKASQKRKPLSRLLEQLLRNLEKRDPNQFFAWPVNDNFAPGYSTIIKKPMDFSTMKQKIDDNEYKSLNCFISDFKLMCNNAMKYNKPGTVYHKAAKRLLHAGLKQLTPQKLRPLGDILTYMYEIPLRELGFDIGKMDVVSPTQQDSTPLRTYGRKVGIISILCSDMHEGWYMCHNRTILIAEAERDSQSRSSWTSHNESGAPDHKVLKRSSPVKSGGEEEAASDGGEVGDGGGDGVKIQMEAAREQHRRRLAKKAFPRMDAEGKTTLKLVTHTVDGSGEDKPLTLGQYIGKLTQGTGSLHIPREDRRNIAKCVRPLHYGPFSSYAPSYDGTFATLSKEESHLVLHTLREYTHAPDKRNRPETPPPRDESQELSKVKIDIDELRSLSSLGIDVDFLNELEMATKDYGLGPALKHTYGLLAALEKEQRERLSASSPWHLSLVAGAGQAEREAARAAAGWLRAMVARVPPKEVATHAALRNAMGVTLEHLEGDPPRAESQELSKVKIDIDELRSLSSLGIDVDFLNELEMATKDYGLGPALKHTYGLLAALEKEQRERLSASSPWHLSLVAGAGQAEREAARAAAGWLRAMVARVPPKEEQIQRLLL-