Monarch geneset OGS2.0

DPOGS208661
TranscriptDPOGS208661-TA4032 bp
ProteinDPOGS208661-PA1343 aa
Genomic positionDPSCF300281 + 249917-259125
RNAseq coverage190x (Rank: top 48%)
Annotation
HeliconiusHMEL0117440.070.13% 
BombyxBGIBMGA007759-TA0.060.62% 
DrosophilaCG9727-PA3e-5448.17% 
EBI UniRef50UniRef50_F5HKS94e-5449.28%AGAP013226-PA n=1 Tax=Anopheles gambiae RepID=F5HKS9_ANOGA
NCBI RefSeqXP_001954622.15e-5548.62%GF16654 [Drosophila ananassae]
NCBI nr blastpgi|1947442791e-5348.62%GF16654 [Drosophila ananassae]
NCBI nr blastxgi|3071976421e-5648.12%Regulatory factor X domain-containing protein 2 [Harpegnathos saltator]
Group
Gene OntologyGO:00036773.4e-25DNA binding
GO:00063553.4e-25regulation of transcription, DNA-dependent
KEGG pathwayxtr:3950251e-31 
 K08061 (RFX5)maps-> Primary immunodeficiency
    Antigen processing and presentation
InterPro domain[151-225] IPR0119911.3e-27Winged helix-turn-helix transcription repressor DNA-binding
[146-216] IPR0031503.4e-25DNA-binding RFX
Orthology groupMCL21988 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208661-TA
ATGGATAGTTCAACTCCGTTTCAACCTTGGTCTAGTGACGATAATTCAAAGAATGTTAAATCTGAGCGAGAATTAGTAGATGACGCGAAACTTCATCGGGAAAATGTCAATCATCCAAAACATAGGCAAACTGCGTCTTACAATAAAGAAAGTGTCGTCGATCCTAGCGTCGTTCCTGGACCATCGAGTGCCACGGACACCGATGTCCATCCACTCGGCGGTAAGGAGTCAAAAATGGACTCCAGCAAAATAGCTTCCATGCAACAAATAGTCGAGAACACACTCAGTCAAGAGGGCCGTCAAAGAGTGTCACAATTGTTGGAAGCTGTAGAGGGTCTCAGTGGTGCGGAGAGATTGTTACTTTATCTCCGTCTACCAACCGGTGTACCTCCACACGATCCCCTCAAACAGCCAGTCAATCCGCTGGGCTCCAGAGCCGAACTACAGCAGACTGTAACGTGGATACAAACACACTTGGAGGTTGATCCTGACGTTTCGTTGCCAAAACAAGATGTTTACGATGAATACATAGCTCATTGTATGACCAGCAATATGAAACCACTATCGACCGCTGATTTCGGCAAAGTCATGAAGCAGGTGTATCCTAGTGTGCGCCCGCGCCGGTTGGGAACGCGCGGCAATTCAAGATACTGTTACGCTGGTCTCCGGAAGAAAGTTAAACTTGAAGTGCCACAGTTGCCAAATTTGGGTGAATCGACCAAGGAACCAAGTGTGCCTTCAAGAGAAAATGAAAGAATCATTTGTGATTGGGCTGAATCTAAATTGGGCGTTAAGTTCATGAACATATCGGAGTTATCTCGTCACCTGTTAAGCGCGATGCGCGCCCCGCCCGGTCCGACCAGCGCCACACCGCCGCCGCAAACGCATGGATCTGATGAACCACCGGGACCACAGTTAATGAAACAACAGTTAAAAAGAAAGCTACAGACGCAGGGTACGGTGGGTCGACCGAAAAAGAACAAGGGGCAAGAAGTCGCCGGGGAATCGCCGCCGTCCACTTCATACGTAAGCCATCCCACAGTGAAACATGAGAGCGAACTGGTGCCCGAATCGTACGGCTATCAGCCGGCTTACATGCCCGTTTATGACGTACGGCCGGCCTTTCCCTATGCGGCGCCACCTCGTGCTCACCCCCCCCATCCTCTTCCCCCACACCCTCATCCCCCACCACCGCACCCGCCCGCCATCCCAGTACACGATTACCGGCCCGATCCTTACGTCTTCGAACCGCCATATGCACCACGCGACCTTACTCTTCCGGACACGGCCCACAACGTACCGATTAACCTCAGCAGTGATGTTTCCCTCGACCTGTCCACCGAGCGCACAGAGTGGCAACGTCGCCGTCCCCCGGACCCTCCGGCTCCGACCGCCCGTCTTCCGCTGCCTGGGAAGAAGTTAATCTTGGAGACGTATCAGAGCGAGACTCAAGCTAGCTCCCCTCGCTCGTCACAAACTGACACGCGAGCCGTCCACCAGCCGCCAGAAGAATTTCCGCGCACGGAATACTTGCCTAAGAAGATGCGTGCCGCTGAGATACTGGGCGGTAAGTTGGCGGCACCGAGACAGGTCGCCGCTGCGGACGCGTCGACGTCATCATCGACAGGGGCGGAATCGGCACGTTCCGAGGTTGCTTTTTTAAAGGATCCAAAAAACTTAACGAACCGGTCCAAAAGCACCGCCGCGGCGCTGGCCCGGGAGGAACAGGAGGCGGCGAGTCCGACACGGTCGGTGAAGGCTCCGACGCCTAAACACAATCGAACTAAACTGAGAAAGTCGTCACACATGAGAACGAAGGGGTCGTCTCCGGAGAGGAACGAGCACGACCATTCCGCGGCGCCCGATACCTGTTGTGGTATTAATGGCATCAATATTATTACGGGAATGATCTGTGGGGAAAAACATTCTGAGATACTGAACCGTGAACGAGTTATTAGCATCTGTAATATCGATAAACACGATTTAGACGATTATCTCAATGAGGGCAATAGCCAGGAACACGAGGAAGAATTAATGCAATACTTCCATCATCGCGATGGTGACACAGATATACCAGCAAAATCTAATCAGAACGAAAATACGACTTTTTTAGAAACAAGTCAGGACCCACACGACGAACATAGTCAAGGGAAAAGTGAAAAAATATCGCAACTGCGAGAACTATTAGCTAAGAATTTGAAAAGTGGGTCCAATACACAGAATTTATTATTAAATCAAGAAAAACAAACTCCGATTAATAATAATCACGAAACACATGTTAATAAGAGTTCAATATCGGACGGAGCATTTAGGCCACTAAACAACGTTATGGAAATGATAAACGGCTCCAATGGCTCAAACGAAAATGAAAATAATGTAAGGAAAAAATGTGATAATATTCCAAGTGAAAACGGAATTTCCGCTTGTGCCCCAGAGAATTCACACACTCAGGTGATGACAACAGCGACTGGTTCTCATCTTTACAATGGTAACAGTCATACGGAACCACAAAGTCCAACTACTAGAACACAACAATACGATTTCGTACCTATATCCGATGGATGTCATTCTCCTGGGAATTTTAATTCAAAGTCTCCTTTAGGTTTTGGACAACGAGGGCACAGTCCAACTAAAACGAATAAACCTATTATAATGGGTGGTTCCTTGCAAACATCTCCTATTTCACATAGCATGGCAGCAAGTCCTTTTGTCAGTCCAAGAAATACTCCGGTGCCAAGGTCACGCTATTGTTCTAGACCAATACCTAAACACAACAATAGAAAGAGACGTACCATTTTGTCACTCGGTGTAAATGAGACTGGCACTAAACAATTCGCAATACCAAATGATCAGAAATTCCTGCCGAAGTCTTCTTTGGGTGGGCAGAATATAAAATATTGCCAACCAATGTCAGCACCTCCATCGCCCAACCTCCTACCACACTTTAAACAACAAATGATACAAAATTCCGTGATACCAAATGGCACCAGTAATATGTGCTTTGTGTCAAATACGTTTCGAGAGAATGTTAATGGTGAAATGCCACAACCATTATCAGCAGATCCGTTGTCCAGTGAAGTGAGTCAATTTTTTCAGGAGCCTGTAACGGGTTATAGAATAGCTCACGATACATCTTTTAGGTCACAATCGGTACCATTGAAACAGGCAACCATAAACATTGGCTTGTTGAGCTACAACAATACACCAGTCGGTTCCGTCCCACCTACACCAGTACCGAATGAGTTTTGCGATTTTGGTTCTTTAGCTGACACATGTGATATATCTAGGGCAGGACTCAATCCTGAGACTCTAGATAAAATATACGACGCTATTGATAGCAGCAATGACGTACTTAACGGTGGCACGAGTAATAACATATTAAACTCCAGCGACACCCTAACTAACGGTTGTGATCCTTTACCCGAACAACAAATTCTGATAGATGAACAGTTAAGTTCACTTATACCTAGCGGGGAAGAATTGCTCGATAGGAGCTCCCTTATGGCCAGCGGAGAAGACTTACTCGAAAGGGGTGAACAATTCAATGAGTCGTTTCCCAATGACTCGACGGCTTCAGAGAATGTGGAAGAGTTCTTGAAACGTACAAACAGCATAGAATTTGATTTATCTGATTTGGTAACAGAAAAAAATAAATACTACGCATCACGTTCCGTACCAAGTACGCCATTGCCTTATAAGCGAACGGCTTCGAACTTACAAATAGATCCACGGCACGCTAGAGATCTCTTCGCTACCGAAAATTATTCCAGTGTATCGAACGGTATATCTTCAAAATCCGTGCCATCTACACCACAATTAGCGGAAGATCGCAGTGTTTTTAGTTACACCAACAGAGACTTTCTCATTAACGGAAACTCGGTTGACATGTGCTCTAATCAGATTAGGCAACCGGTTGAAAATGACCAAGCCTTGACGTCCCCACTTGACGAAATACTAGGTCCTCTAACGCCAGCCGCAGATCTATTGGCTGACCTCGACAAAATAGATACTGCTCCATATGTTGACCTCTAG

Protein sequence:

>DPOGS208661-PA
MDSSTPFQPWSSDDNSKNVKSERELVDDAKLHRENVNHPKHRQTASYNKESVVDPSVVPGPSSATDTDVHPLGGKESKMDSSKIASMQQIVENTLSQEGRQRVSQLLEAVEGLSGAERLLLYLRLPTGVPPHDPLKQPVNPLGSRAELQQTVTWIQTHLEVDPDVSLPKQDVYDEYIAHCMTSNMKPLSTADFGKVMKQVYPSVRPRRLGTRGNSRYCYAGLRKKVKLEVPQLPNLGESTKEPSVPSRENERIICDWAESKLGVKFMNISELSRHLLSAMRAPPGPTSATPPPQTHGSDEPPGPQLMKQQLKRKLQTQGTVGRPKKNKGQEVAGESPPSTSYVSHPTVKHESELVPESYGYQPAYMPVYDVRPAFPYAAPPRAHPPHPLPPHPHPPPPHPPAIPVHDYRPDPYVFEPPYAPRDLTLPDTAHNVPINLSSDVSLDLSTERTEWQRRRPPDPPAPTARLPLPGKKLILETYQSETQASSPRSSQTDTRAVHQPPEEFPRTEYLPKKMRAAEILGGKLAAPRQVAAADASTSSSTGAESARSEVAFLKDPKNLTNRSKSTAAALAREEQEAASPTRSVKAPTPKHNRTKLRKSSHMRTKGSSPERNEHDHSAAPDTCCGINGINIITGMICGEKHSEILNRERVISICNIDKHDLDDYLNEGNSQEHEEELMQYFHHRDGDTDIPAKSNQNENTTFLETSQDPHDEHSQGKSEKISQLRELLAKNLKSGSNTQNLLLNQEKQTPINNNHETHVNKSSISDGAFRPLNNVMEMINGSNGSNENENNVRKKCDNIPSENGISACAPENSHTQVMTTATGSHLYNGNSHTEPQSPTTRTQQYDFVPISDGCHSPGNFNSKSPLGFGQRGHSPTKTNKPIIMGGSLQTSPISHSMAASPFVSPRNTPVPRSRYCSRPIPKHNNRKRRTILSLGVNETGTKQFAIPNDQKFLPKSSLGGQNIKYCQPMSAPPSPNLLPHFKQQMIQNSVIPNGTSNMCFVSNTFRENVNGEMPQPLSADPLSSEVSQFFQEPVTGYRIAHDTSFRSQSVPLKQATINIGLLSYNNTPVGSVPPTPVPNEFCDFGSLADTCDISRAGLNPETLDKIYDAIDSSNDVLNGGTSNNILNSSDTLTNGCDPLPEQQILIDEQLSSLIPSGEELLDRSSLMASGEDLLERGEQFNESFPNDSTASENVEEFLKRTNSIEFDLSDLVTEKNKYYASRSVPSTPLPYKRTASNLQIDPRHARDLFATENYSSVSNGISSKSVPSTPQLAEDRSVFSYTNRDFLINGNSVDMCSNQIRQPVENDQALTSPLDEILGPLTPAADLLADLDKIDTAPYVDL-