Monarch geneset OGS2.0

DPOGS205016
TranscriptDPOGS205016-TA1746 bp
ProteinDPOGS205016-PA581 aa
Genomic positionDPSCF300442 - 50551-59243
RNAseq coverage341x (Rank: top 34%)
Annotation
HeliconiusHMEL0142741e-13269.27% 
BombyxBGIBMGA001666-TA1e-12162.16% 
DrosophilaBrd8-PA3e-4242.13% 
EBI UniRef50UniRef50_Q17F512e-4946.77%Polypeptide of 976 aa, putative n=1 Tax=Aedes aegypti RepID=Q17F51_AEDAE
NCBI RefSeqXP_310403.68e-5348.12%AGAP003845-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479707401e-5148.12%AGAP003845-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3123747291e-5731.97%hypothetical protein AND_15595 [Anopheles darlingi]
Group
Gene OntologyGO:00055157.7e-31protein binding
KEGG pathway 
InterPro domain[429-567] IPR0014877.7e-31Bromodomain
Orthology groupMCL17487 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205016-TA
ATGAGTTCAATACAGGAGCGTTTGCAATTAAAGCGTGTGCCGCTGGACACGTGGAATGTGAGGGAACAGCTATGTCTAGCGTCGGCAGTCGTGAGGAGCGGCGATCAGAATTGGATGACGGTTTCTCGGGCTCTTAAAACTGTCGGAGAATCAAACAGGCCGCCGGACTGGTATTCTCAAAAAAGTTGCGCCGCTCAATATGGAGCCCTATTGGAGCACGTTGAGACACCGAAACGCAAGAAAAGAAACTCGGAGGGCGGAGTGGAAACACCGCAGGAGAGCATCTTAAAACAATTAACGCAACAACGTATACAGGAAATACAGAACACATTGACGGAAATGAATTCCCAGTACGAACAGTTGAAGAACGAAATTAACGAAGTGCGCAACAACTCGACTCCAGATGAAAGGATCCAGGAGTTGTGGGCTGGTATAGAGAGCGCCAAGCGAGCGAGGGAAAGAGACAGCGCTAGGCGGGCTGCCTGGCTGAAGGAAAGAGAGGAGAGACGGGCCAGGGCAGAGAGGACGTGGAGGCCGCAGACCACGTCACCGGCTGCTAGCCCCGCGCCCCCGTCTTCGCCTCTGCTGACGTCACTCCTCAAGTCCTCACCCGTGGTCACAACGCCCCAGCACATACCACATCCGGCTAGTGTTGTGGACACGGTGTCCCCATCAGCTGGTGCGCCGACGCTCTCCCTGCTGTTGGAACTGCCCCATGATAACAAACATGCCGCCATAGAACACATCAAGAGCCAGCTGGTTCAGATAGAACACCAGCTGAAAGCTAATTCCCAGCCTGCGGGCCCGGCGGTCCCGGCGGTCCCAGCGGTCCCGGCGGTCCCGGCGGTCGATATAGATGACATTGAGATAAAAGCTGAAGACGTCTATGCCTTCCGAGACATAGACATCCACATCCCACCCGTGACCAGCATGCACAAGGCTAACGTGCGGGTGCTGGAGAAGTCTAAGAAAGGTGCGGAAGCTGAGCCCGTTGATGTTGAGAACACAGAGGCACCCGTTGCTATGGACTATAAGGAGGAAACGACAGTACAGGAGAGCCAAAAGGAAGAGCCGCGGACAGAGACGCCATTACCGGAGGAAACCCAACCGGAAGTGATAGAAGAGAAACAAGAGGAACCGGAGAAAGAGGAAGTCAAGATCAGCTTCCCGGAAGTCAAGTTCCCTACGCCGGAAATAAAAGTTATACAAGATGAACAGAAGTTGAGTAGAGAAGAGGACAGAGACGGAAAGAAGAAAAGGGACTATTCGAGGAAAAAGAAATCAGATTCTAGAACCTGTTCAGGTTCGGAGAGCGCTCCGGAGTCTCCGTCAGCGAGCGACGCCGAGCGGCAGCACAGGTTGTGGAGGAAAAGCGTCATGCTGGTGTACAGCAGGCTGTGCGCACACAAATACGCTTCCCTGTTCCTGAGACCGATAACCGACGAGGAGGCGCCAGGGTACAGTGTGGTGGTCAAGAGGCCCATGGACCTCACCACCATACGCAGGAACATCGACTCGGGAAACATACGAACAACAGCGGAGTTCCAGCGTGACGTCCTTCTGATGCTGTCCAACGCCCTCCTTTATAACAGCAGCTCGCACAGCGTGTACAGCATGGCTAAGGAGATGCATCAGGAGGCTCAGTGTCAGCTCGCGATGCTAGTGGCGGCCCAGGCCCACGCCGGCCTCAACCCCCCGCCCGCCAGGAAGAGACGCTTCCACGCGCACTCGTACAAGAGAAATTAA

Protein sequence:

>DPOGS205016-PA
MSSIQERLQLKRVPLDTWNVREQLCLASAVVRSGDQNWMTVSRALKTVGESNRPPDWYSQKSCAAQYGALLEHVETPKRKKRNSEGGVETPQESILKQLTQQRIQEIQNTLTEMNSQYEQLKNEINEVRNNSTPDERIQELWAGIESAKRARERDSARRAAWLKEREERRARAERTWRPQTTSPAASPAPPSSPLLTSLLKSSPVVTTPQHIPHPASVVDTVSPSAGAPTLSLLLELPHDNKHAAIEHIKSQLVQIEHQLKANSQPAGPAVPAVPAVPAVPAVDIDDIEIKAEDVYAFRDIDIHIPPVTSMHKANVRVLEKSKKGAEAEPVDVENTEAPVAMDYKEETTVQESQKEEPRTETPLPEETQPEVIEEKQEEPEKEEVKISFPEVKFPTPEIKVIQDEQKLSREEDRDGKKKRDYSRKKKSDSRTCSGSESAPESPSASDAERQHRLWRKSVMLVYSRLCAHKYASLFLRPITDEEAPGYSVVVKRPMDLTTIRRNIDSGNIRTTAEFQRDVLLMLSNALLYNSSSHSVYSMAKEMHQEAQCQLAMLVAAQAHAGLNPPPARKRRFHAHSYKRN-