Monarch geneset OGS2.0

DPOGS209280
TranscriptDPOGS209280-TA2898 bp
ProteinDPOGS209280-PA965 aa
Genomic positionDPSCF300522 - 8148-17819
RNAseq coverage535x (Rank: top 24%)
Annotation
HeliconiusHMEL0176191e-3841.13% 
BombyxBGIBMGA001715-TA9e-5472.50% 
DrosophilaCG16952-PB2e-7535.36% 
EBI UniRef50UniRef50_Q7QD452e-7737.03%AGAP002951-PA n=10 Tax=Endopterygota RepID=Q7QD45_ANOGA
NCBI RefSeqXP_970886.13e-8237.85%PREDICTED: similar to BTB/POZ domain-containing protein 7 [Tribolium castaneum]
NCBI nr blastpgi|3454933375e-8832.35%PREDICTED: BTB/POZ domain-containing protein 7-like [Nasonia vitripennis]
NCBI nr blastxgi|910942833e-8036.32%PREDICTED: similar to BTB/POZ domain-containing protein 7 [Tribolium castaneum]
Group
Gene OntologyGO:00055154.8e-07protein binding
KEGG pathway 
InterPro domain[294-449] IPR0113339.2e-13BTB/POZ fold
[304-450] IPR0130694.8e-07BTB/POZ
Orthology groupMCL11220 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209280-TA
ATGTTACTGGGTTGTACTGAGCTGTCATGTTTGTATGATAAGGACGGGCGCGTGCTGGCCGATCGGTGCGGTGATGTCTTCCGTCCGATGAGCCGGCTCCTCTGTTGTCTGCTCGTCCGCCGCCCCGACATGGGCGCTGCGCTGTCGCACCCCCAGGAGGGCGAGGCATGTGACGCCGGCGTACCGCCCTCCCCACCCACCATGGCTGACGTCATCAGGGAGCGCAAGAAGAAAGCTGGTGGGGCTGGTCTGGGGACGTTGCGGAGGAGGTTGGCGGCGGCGACACGCAGACCGAGGGACTCGAGGCCTGATAGAGGTTGCGAACACGCTCGCTTCATCCGCTCCGTGGTGTCCACCTGGAGGTTGTCGGAGGTTTTCCTTCTGCGCGAAGAACTGGAAGCGGGCGCTGCGTTACGAGACCTGGCGACCCAGGCGGAACTGGCGCGCGAACCGGCCCCAGCCCTACACGCGGACCTGTTGGAACTGTTCCGGGAGAGGTGGTGGTGTGACGTGGAACTGGTCGGGAAGGGGTTCGCGCTCCCAGCACATAGAGTCATACTAGCTGCCAGGTGTTCATACTTCAGGGAGCTGCTGATGAGATATCCCAATTCGTGTCGTGTGCCGCTGGAGGGCGCTGGTGGGTCCCTGCCGCGGGAGGAGCTGGAGGCGGCCGTGGTAGCCATGTACGCGGGGCCCGCCGCGCTCAGGAGCGGCGCCAGATGTGACGCATGCTGCAAGTGGGAGCGAGGCTCGGAGGCGGAGCTTGACATCATCAGCGTGGAGAACTCAACCATCCGGAGGTGTACCTGCGGGCGCGGCAGGCGTATGGAGCCCTCGAGACTCCGCCGGCTGGCCGACCTGCTGGGCTTCACTCCGGAAGGACTGCACAGAGATATGAAGTATCTCCTGGACTCGGGTGAGTTGTGTGACGCTCGTCTGTCGTGGTCGGTGGAGGGTGAGGGTGCGGACGGTGACTGTGAAGGCGACGCGGAGGGTGACGGGGGTGCGTACGGCTTCAGGAGTGCTATGGAGCTGCCCTGCCACAGGCTGGTGCTGGCTGCCAGGTCTAGATTCTTCAGGAGTGTGATGTCTCGGCGCGGAGCGTGTTCAGGCACCGTGTGTGTTGATGAGCGTGTGCTGCCGAGGAGATTCGCCCGGGCGCTGCTGCACGCGGCCTACACGGACCAGGTGGACCTCTCTCTGATATCAAAGAGCAGTTCCAACGCAGCCTCCGGCTCGGGGACGCTACGCGGTTCTCGTGGCAGTTTAGAAGATGCGTTCCGTTTGTACGAGCTGGCCAGGTTCCTTGAGATGCCGATAGCTGCTCAGGGTTGTGAAGATGCTCTGGTCCGAGCGCTGTGTGCGGACACACTACCAGCCATTATACGGTGGAGTGGAGCCAAACACGCCTCCGCATGGGTCCATCGGCAAGCGACCCGCTACCTCCGAGATGAGTTCCCAGCTATCATGTCTCACTCGGGCTCCGGTCGTGTCCCTCGGGCTGCGCTGGCGGCCGCGCTCGCCTCACCCTTTCTGCAAGCGAGTGAGGCGCAGGCTCTGAGAGCTGTCATGAGGTGGGCGGAAAGGGCCGCCCCTGTACAGACGTGCAGTGAACCCAATATAGTCTGGCACACTAGGCGCGACGGGCGCTCCCGGCGTCGGCGGCGCGGGCCCGCGGACGACGCCCTCAGAGAGGCGGATAGTGTAATGATAGTTTACTGTGATGTATGTATGTCTACCTGCAACAGTCTTCGTCCACGTCGCTTCTCGAGGCGGCCGCCTGAGCCGACACCTCCCTGTACATACACCATGTTTGAATCATTATTCTCGACTGTTAATGTTAAGGATCTCACTAAGACCGCGAGTAGCACGTGTAGCACAGAGGGTGTCCGAGCGTGTGTCCCGTCCCGCGTGATGTGCGCTCTGAGGGCGAGGCTCAACGAGCTTAGGGCTGCGCCCCCCGCACAGAGAGCGCTGCGGCTGCATGCGGCTGATACCACGCCCGTCACCAGACAGCTGGCTCTGCGAGCTGTCCGGGAGCGATCTCTGCCGGACGCTGTCGCCGAGCTGCTCCTGAACGACGACGACGACAGGGAGGTGTCGGCTCAGGCGGCCGCCTCGAGGAGCGACGTGGACGAAGACTGTTGCAGGTCAAACTCCGGGTCGCTGAGATGCGGTTCCACAACCACACATACCTCTATAGAGAGCCGTCCGGGAACATACCGCAGGGAACTACCAGCCAGGCCGATAGAGACACATCACAGCGGCAGTTGGTCGGGAGGCCGTCTGTCAGCCGTGGTGCCGGACGTGGCGATGGCGCCGAACGCGAACACTCACCTCCTTACAGCGCGGGACTACCCGCAGATACACGCGCCCATACACACGCACGCGCACGCCGCGCCCAGGGACAACACCATCACCCTGCCCGAACTTGGAGTGCTGCAGCTGGACCTGGGCGATGGAGCAGCACACGCACCGAGACACGGGTCACGAGCACAGAGAGCGGCTATAGACGCGAGGCGGAGGGAAACACAGGGTGACCACGACGAACTGCGCGCAGCCATAGAGTTATCAGTTATAAGAGCGTACTCCTCGCTACAAGCTGCGAGGAGAAGTGGGGCGAATGGAACAGAGTTCTCTACCTCCAGACCCCGCGCTACTCCTAGTCCGTCTCTAGCTGTGGCCGCGGGCGGACCCAGGAGGAGAGAGAGACAACCGCCGCCGCGGCCGTGTCCGCCGCCCGTGGGGCGCAGTCCCAGCCCGCGACGGAGCCCCGCAGCTTACGGACAGAGGGACTATCGTGCCGAGGGCGCATATAGTCGTAGCCCGCGAAGCAGAGGACACAGCCCCGCGTACACACAGTCGGAGCACCCAACTGACTGGAGCCCTGGATCATAA

Protein sequence:

>DPOGS209280-PA
MLLGCTELSCLYDKDGRVLADRCGDVFRPMSRLLCCLLVRRPDMGAALSHPQEGEACDAGVPPSPPTMADVIRERKKKAGGAGLGTLRRRLAAATRRPRDSRPDRGCEHARFIRSVVSTWRLSEVFLLREELEAGAALRDLATQAELAREPAPALHADLLELFRERWWCDVELVGKGFALPAHRVILAARCSYFRELLMRYPNSCRVPLEGAGGSLPREELEAAVVAMYAGPAALRSGARCDACCKWERGSEAELDIISVENSTIRRCTCGRGRRMEPSRLRRLADLLGFTPEGLHRDMKYLLDSGELCDARLSWSVEGEGADGDCEGDAEGDGGAYGFRSAMELPCHRLVLAARSRFFRSVMSRRGACSGTVCVDERVLPRRFARALLHAAYTDQVDLSLISKSSSNAASGSGTLRGSRGSLEDAFRLYELARFLEMPIAAQGCEDALVRALCADTLPAIIRWSGAKHASAWVHRQATRYLRDEFPAIMSHSGSGRVPRAALAAALASPFLQASEAQALRAVMRWAERAAPVQTCSEPNIVWHTRRDGRSRRRRRGPADDALREADSVMIVYCDVCMSTCNSLRPRRFSRRPPEPTPPCTYTMFESLFSTVNVKDLTKTASSTCSTEGVRACVPSRVMCALRARLNELRAAPPAQRALRLHAADTTPVTRQLALRAVRERSLPDAVAELLLNDDDDREVSAQAAASRSDVDEDCCRSNSGSLRCGSTTTHTSIESRPGTYRRELPARPIETHHSGSWSGGRLSAVVPDVAMAPNANTHLLTARDYPQIHAPIHTHAHAAPRDNTITLPELGVLQLDLGDGAAHAPRHGSRAQRAAIDARRRETQGDHDELRAAIELSVIRAYSSLQAARRSGANGTEFSTSRPRATPSPSLAVAAGGPRRRERQPPPRPCPPPVGRSPSPRRSPAAYGQRDYRAEGAYSRSPRSRGHSPAYTQSEHPTDWSPGS-