Monarch geneset OGS2.0

DPOGS204391
TranscriptDPOGS204391-TA3957 bp
ProteinDPOGS204391-PA1318 aa
Genomic positionDPSCF300002 - 1493261-1505033
RNAseq coverage1173x (Rank: top 11%)
Annotation
HeliconiusHMEL0078230.083.54% 
BombyxBGIBMGA000815-TA2e-1739.34% 
Drosophilafs(1)h-PB6e-8058.53% 
EBI UniRef50UniRef50_E2AA500.053.90%Homeotic protein female sterile n=3 Tax=Formicidae RepID=E2AA50_CAMFO
NCBI RefSeqXP_624214.20.055.09%PREDICTED: similar to bromodomain containing 3 [Apis mellifera]
NCBI nr blastpgi|3504161030.056.75%PREDICTED: homeotic protein female sterile-like isoform 1 [Bombus impatiens]
NCBI nr blastxgi|3287886370.046.91%PREDICTED: hypothetical protein LOC551826 [Apis mellifera]
Group
Gene OntologyGO:00055156e-46protein binding
KEGG pathway 
InterPro domain[33-173] IPR0014876e-46Bromodomain
Orthology groupMCL10372 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204391-TA
ATGCCTCTTAGCCAGAAATTGGATGAGCCTCTTGTTACAATTGAGATCGCAGACTCCAGCACAATGCAGCAGGCAGTCGATCCACTTGCTACAACAAGCACTCCGATGTCGGAACCCGTCAATGGGGCGTCCTCGGAGGAGGCTCCTCGTCGACAGGGTCGTATGACGAACCAGCTGCAGTTCCTACAGAAGAATGTGATAAAGGCAGTGTGGAAACATAAATTTGCGTGGCCTTTCCATCAACCAGTAGATGCAAAGAAATTAAACCTACCTGATTATCACAAAATCATCAAAAAGCCTATGGATTTAGGCACAATAAAAAAGCGTTTAGAGTCAAACTACTACTACTCAGCTCAAGAATGCATACAGGATTTCAATACCATGTTCACAAATTGTTATGTTTACAACAAACCTGGAGAGGATGTAGTGGTGATGGCACAAACATTAGAAAAATTATTTTTAAATCGGATAGCGCAAATGGACAAGGAGGAGAAGGAGATCGAGATGCCTTCGAACAGTGGGAAGAGTGGTGTGAAGAAGCGAGTGGGTGGTTCAAGTGTAGGCGGACCTCCGATGGCGGGAACGGGTTCAATGCCCGCTTCACCGGCTCTCACTTCTCGTGCAGCCGTTAAACCGCTCCCGCCGGCGCCGCATCCCAACTTTGTGGGCTCTACAAACACAACAACCACCCCAACTCTAACAGCGCCTTCAGTCACCCCACCCGCCACGCACACCGGGTTGCCGCAGCAGGTTGCAACTCAACCTTCTAATTTCCACGTAACACAAGCAGCCGCTCCTCCGGTATCTACGCTTCCTGCTGTGGCATTGTCACAGACTCAGCCCGCGAAGGTTAAAAAAGGCGTTAAAAGAAAAGCTGACACTACGACTCCTATGGGTAGTTCCTTTGAAGGAGGCTATACAACTCCTACGATCGATCAGCAAGGTGGCCCTAAACCAGCTAAAATATCAACAAGAAGAGAAAGCGGCAGACAGAAAAAGCCCGGACGAGTGGGAGACGACGGGTTCAAGATGGGCGGTCTGTCGCCTGGCGTGGGCGGTGCGGGAGCGTCACACCACGCCGCGCTTACTCCACAGGCCGCCAAGAACAAAGAAAAACTCTCCGACGCGCTCAAAAGCTGCAACGAGATACTTAAAGAACTTTTCTCTAAGAAACATTCGGGTTATGCATGGCCGTTTTATAAACCTGTGGATGCGGAATTACTAGGTCTACATGATTATTTTGATATTATTAAGAAGCCTATGGACCTGGGCACAGTGAAACATAATATGGATCATAGAGCGTATAAAACGGCTGCCGAGTTTGCAGCAGATGTCCGCTTAATATTTACTAATTGTTATAAGTATAACCCTCCCGATCACGATGTTGTTGCGATGGCTCGGAAATTGCAGGACGTTTTCGAAATGAGATATGCAAAGATTCCTGATGAACCAAGTCACGTCCATGTCGGAGTTCCACATATGGACAAAGGAAGTTCCGCTTCTAGTTCCGAATCAGGCTCGGAATCTGACTCTGAGTCAGATGACTCGGAGGAGGAAAGAAACAACAAGGTCAAAATATTAGAAAAAGAGCTCCTAGCGCTGCAGGAAAAAATGAGAAAGCTCGTCGAGGAATCAAATAATAAGAAGAAGGCGAAAAAGAAAATGAAAGACAAACAGAAAAAACAAATTACCAATAATGCAATTCCCAAAACGAATGCTGTGGCAGCTTCCGGGTACAACGCTAAAACGAACAACATCGCTGAAAACCTAGCGACAAGCGTCCGAGGGAAGACTGGCAGTAAACGTGGCGCGGGCGCCAATGCAGCTGGTGCGGCAGCCGTCACCGGACAAGCCAAGGCCGCGGCTCGCGCACCGGCGAAGAAGAAGAGCTCCACCCCCACCGCCGCGCCTCCCCACCACGCACCACCACACCACCAGGACCCCGACACGGACGACGAGGATAACGCCAAGCCCATGTCCTACGACGAGAAGAGACAGCTCTCGCTCGACATCAATAAGCTGCCCGGTGACAAACTTGGCAAAGTAGTCCATATTATCCAAAACAGAGAACCCTCGTTAAGGGACTCTAACCCGGATGAAATTGAAATTGATTTCGAGACGTTGAAACCATCCACCCTTAGACAGTTAGAAAGCTATGTCGCGTCATGTCTGCGGAAGAAAACTCATCGAAAGGTGTCTGGCAAATCTAAAGATGAACAAATGGCGGAAAAGAAACAAGAGTTAGAAAAACGGTTGCAGGATGTGTCAGGACAGTTAGGAAGCAATAAGAAACAGCAACCTAAAAAAGAAGGATGCAAGGACGGCCTCGGCGGCGGCATGTCGTCGTCGTCCAGTTCGTCGGACTCGTCCAACTCGTCGTCCAGCACCGACACCAGCTCGTCGGACAGCAGCGACAGCGAAGCAGGTGCCAGCTCCGGAAAACCTTCCAAGAAAAAGGGAAAGAAGCAGGTTCAACAGCTGCCGCAGCAAACTAAAACGGTTCCAACCGTGGTGAGTGCGGTGGCTCCGCCCGCTCCGCCGCCAGTGCCGCCCTCGTCTTCCGAGCCCGCGGCGCCGCCCGAGCCCAAGCATGAGCTGGTCCAACAGCCGTCCGCCCCCGCGACATCCTCTTCCCCAAATCCCGCACCCGCCCCGGACATCAAACTGTCGCCTCCGGCTTCGCTCGCTCTACCTGTGCCTCCAGTTAACCACCCTCCTGTTACCAGTCTACCGGACTCGTCAAAGACGCCGTTGCTTAGGGAACAAAAGCCCCAAAAACCATTGGTGAAAAGCGAACCTCGAAACGGTCTAGATAAAGTAGACACGTCTCACTACATAGATCCTATAGAGCGGTCGTTGGCCAGCTTGGAAAGAAGTCTTCAAGCCGACGTACCAATGGATGTCAGTGTGAGTGTCTCCGAGTCATCGATGAGACTAGAGGACTTTGCTCTGCCGAAACCTTCCATTATGCCAGACGCTACGCATCACAATCTAATGGCCCAACTCGGAGGGCTCACGGACATGACCCATGTAACAGAGCAAATCAAGAATGAAATGTACGTACAACCTCATAACGGATACGTAGAGAAGACCTCGCAGCATGAGCGTGAGATGTTGAGGTCCGATATGAATCCTAACCTAGTGGGCATGACCACAGCACCACCTGTGTCATCCATCTTCGACCCCGTATACTCGGCTCCTCACACGCACGCCATCACAGCACAGTCTCATCTCAATATGCAGAGTGCTACAAACTTAATGGCACCTATAGTAAAAAAGGAAGACGTGAAACCGTTACTTACGCCAAAACCGATCGAGGATCTGATGGTCCCTAATATGATTACAAACAATATGAGTGATAGAGCAAAATACGAAATGGAAAAGAAAATAGAGGATAGCAAAAATTCTACTTTTGCTCAAGCTTTTAAACTAAAGCAAGAACAGAATTTAAAAAATGCTAGTTCTTGGTCGTCACTCGCTCAAGCCGGAAGTCCTCAGAGTATTCCTAGCGTCGGAAACACCAATCAAATAAAACAAAAACCAGTTATGGACAGTTTTCAAGCTTTTAAGAAACAGGCTCGAGAAAAGATAGACAGGCAAAGAGCTTTAATTGAGCAGCAGGAGTTGAGGAAGAAAGAGCAAGCGGAAAGAGAAAGACAACGTCAGGAAACAGAAAGGCGGCATCCCGAGGATGACAAAATGAGGGTCGGTGTGAGTGCACGCAAGGTGGAGAGCGCGGAAGTGTCGTCACCGTCAGTGTCGCCGGTGGCGCGCGGCTCCCCGCCCGCAGCGCCCGCTGCTCCGCCCGCGCCCGCAGCACCCGACAAGCCGCCTATCTCTGAACGTGACCGGCTCCGGCAACGAGAACAGGAACGCAGAAGGAGAGAGGCGTTGGCTGGGCAAATTGACATGAATTTCCAAAGCGATACGATGGCGGCCTTTGAAGAAACACTGTAA

Protein sequence:

>DPOGS204391-PA
MPLSQKLDEPLVTIEIADSSTMQQAVDPLATTSTPMSEPVNGASSEEAPRRQGRMTNQLQFLQKNVIKAVWKHKFAWPFHQPVDAKKLNLPDYHKIIKKPMDLGTIKKRLESNYYYSAQECIQDFNTMFTNCYVYNKPGEDVVVMAQTLEKLFLNRIAQMDKEEKEIEMPSNSGKSGVKKRVGGSSVGGPPMAGTGSMPASPALTSRAAVKPLPPAPHPNFVGSTNTTTTPTLTAPSVTPPATHTGLPQQVATQPSNFHVTQAAAPPVSTLPAVALSQTQPAKVKKGVKRKADTTTPMGSSFEGGYTTPTIDQQGGPKPAKISTRRESGRQKKPGRVGDDGFKMGGLSPGVGGAGASHHAALTPQAAKNKEKLSDALKSCNEILKELFSKKHSGYAWPFYKPVDAELLGLHDYFDIIKKPMDLGTVKHNMDHRAYKTAAEFAADVRLIFTNCYKYNPPDHDVVAMARKLQDVFEMRYAKIPDEPSHVHVGVPHMDKGSSASSSESGSESDSESDDSEEERNNKVKILEKELLALQEKMRKLVEESNNKKKAKKKMKDKQKKQITNNAIPKTNAVAASGYNAKTNNIAENLATSVRGKTGSKRGAGANAAGAAAVTGQAKAAARAPAKKKSSTPTAAPPHHAPPHHQDPDTDDEDNAKPMSYDEKRQLSLDINKLPGDKLGKVVHIIQNREPSLRDSNPDEIEIDFETLKPSTLRQLESYVASCLRKKTHRKVSGKSKDEQMAEKKQELEKRLQDVSGQLGSNKKQQPKKEGCKDGLGGGMSSSSSSSDSSNSSSSTDTSSSDSSDSEAGASSGKPSKKKGKKQVQQLPQQTKTVPTVVSAVAPPAPPPVPPSSSEPAAPPEPKHELVQQPSAPATSSSPNPAPAPDIKLSPPASLALPVPPVNHPPVTSLPDSSKTPLLREQKPQKPLVKSEPRNGLDKVDTSHYIDPIERSLASLERSLQADVPMDVSVSVSESSMRLEDFALPKPSIMPDATHHNLMAQLGGLTDMTHVTEQIKNEMYVQPHNGYVEKTSQHEREMLRSDMNPNLVGMTTAPPVSSIFDPVYSAPHTHAITAQSHLNMQSATNLMAPIVKKEDVKPLLTPKPIEDLMVPNMITNNMSDRAKYEMEKKIEDSKNSTFAQAFKLKQEQNLKNASSWSSLAQAGSPQSIPSVGNTNQIKQKPVMDSFQAFKKQAREKIDRQRALIEQQELRKKEQAERERQRQETERRHPEDDKMRVGVSARKVESAEVSSPSVSPVARGSPPAAPAAPPAPAAPDKPPISERDRLRQREQERRRREALAGQIDMNFQSDTMAAFEETL-