Monarch geneset OGS2.0

DPOGS214667
TranscriptDPOGS214667-TA2286 bp
ProteinDPOGS214667-PA761 aa
Genomic positionDPSCF300321 + 983-11812
RNAseq coverage96x (Rank: top 62%)
Annotation
HeliconiusHMEL0036010.085.98% 
BombyxBGIBMGA001942-TA0.085.62% 
DrosophilaCG9313-PA0.046.70% 
EBI UniRef50UniRef50_E2BE130.047.51%Dynein intermediate chain 2, ciliary n=5 Tax=Formicidae RepID=E2BE13_HARSA
NCBI RefSeqXP_314788.40.047.69%AGAP008689-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2700094290.048.44%hypothetical protein TcasGA2_TC008686 [Tribolium castaneum]
NCBI nr blastxgi|2700094290.049.25%hypothetical protein TcasGA2_TC008686 [Tribolium castaneum]
Group
Gene OntologyGO:00055153.1e-37protein binding
KEGG pathwayaga:AgaP_AGAP0086890.0 
 K10409 (DNAI1)maps-> Huntington's disease
InterPro domain[381-713] IPR0110463.1e-37WD40 repeat-like-containing domain
[354-710] IPR0159431.6e-33WD40/YVTN repeat-like-containing domain
Orthology groupMCL12246 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214667-TA
ATGGCGCCAAAGAAGAAAATGCTACGAGCTATGGTCAAAAGAGCTTTGAAGAGAGACGATGACACCGTGCTTATTACTAAGACCCTGCTGCGTTTGGACCAAAAAGAATCATCAAAATCTCGTTCCAATCTTGCTGATGACGAAGCTTTGGAATGGTTGAAACCTAGAACTTTATTGAAGCCAGACGAACAAGAGGATGTTCCAGACGCTGAACTAGATGAAGAAGTTGGACGTTTTTTAACAACAATAAACCCACAATGGGTGGCAGATGAAGTTGCATACCATTATGGCCAAGGAAAATTTGTAAAGAAGCCTAAACCGTCCTTCGGCAAAGCATATCCTCTTTATATTTACCAAGGGACGGCTATAAGGAAGGATTCAGAGGAGGCGAAAGCACAAATCGCTGCTGGATACGGCATGGTCGGATTTGATGACGCTGGTCCAAAGAAAGCTAAGGTTAAGAAATCAGCGGCTGAAGAAGAAGAAGAAGAAGAACCAGAACCCGAGCCGCCTCCACCCGAAGAAACTAAACATGAAGAGGAAGTACAAGAGGAGGAGAAACCAACTGAAACTGTAGAAGAACAAGAAGCACATCCAATTGAAGAAGAAAATGAAGAGGCAGCAGAAGCTGAAGATGTTACAGAAGAAGCAACACCAGAAGAAAAAGCCGGGGGCCACGAAGAAGAAGTTATAGCGATAAAGCCGAGGGTAAAGAAACTCGCGAATCAGTTCAATTTCTGTGAAAGGGCTGCACTCACTTATAATTTACCGCGAAGGTCCGTCGAAACGCAAACTATCCCACCGCCACGTGCAACATACGGCGCTACAGTACTCCAGTGCATTATATTTGACTTCTATCAGGAAGACTACGCTAGGAAACAGAGGGAAAAGGAAGAGGAGAAACCAAAAAGACTGCGTAAAAAAGAAAGAAAACACGCGAAATCCGCGCAGCAAATCCACGACGAACAACTCGCCCAGAGGATAAGAGAGGCTTGGACGATCCTCGAGAGATTGGTTAATCAAAATATTTATGATGACATCGCCCAAGACTATCGCTATTGGGACGATCCCTCGGACGAGTTCAGGGAAGGTATGGGATCTTTGCTGCCACTGTGGAAGTTCCAATTCGAACCTATGAGGAGTCATGCTGTCTGCGACGTGCAATGGAACCCTCACTATCAGGACCTGTTCGCTGTAGCATACGGATCCTTGGATTTCACCCAGCAACAAAAGCAGGGATGTCTCTGTCTATACAGCATCAAGAACCCAGCGTATCCCGAGTACGCTGTCATCACAGAGTCACCAGTTATTTGCCTCGATGTCTTCAAAGAAACGCCATATCTTATTTGCGTCGGTCTTTACGACGGAAATGTATGTGTGTATAACGCCCAACTCACACTAGAATCCTCGTATCAGTATAAATCAGATTCAGTGAGAGACAAACACAGTAACATTGTTTGGGAGATTCGATGGGGCCCTCGTCTCATAGACGGCGAGGCGTCATTCTTCTCTATATCCGGGGACGGTCGAGTGGTCCAGTGGGCGATCATGCCGGGCGAGCTGCAAGCTACCACTATCATCACACTGCGAACAGACCTACCACCACTGCCGGGACCTGACGGGACCTTGCTCACCGTCAACAGTTGCGGCAGCTGCATATGTTTCCACCCGGAGAAGGCGGACATATTCCTTGTTGGTACAGAGGACGGTATGATATACACGTGCTCGCTGAAATACAACAGGAACTATGTGCGTTCGGTTCAAGGACACCACATGCCGGTTTACAGAATACATTACAATTACTACAATAACAGCATATACGCCTCCTGTTCAGGGGACTGGAGGGTGAAGATATGGGAGGACGGGCGGGACGAGCCCCTGTTCATGTTCGAGCTTGGCTCGCCGGTAGGTGACGTGAAGTGGGCTCCGTACTCCAGCACCGTGTTCGCGGCCTGTACAGCTGACGGAAAGGTTTACGTTTATGATCTTAATGTTAACAAATACCGTCCGATCTGCGTCCAAGCTGTTGTATCAAAGAAAACAAAGAAACTCACGAGGATCGACTTCAACCCAATTTTGCCGATCGTGGTGTGTGGAGACACCAAAGGTACTTGTCACGTTGTAAAATTGTCGCCAAATTTACGTGTCATGTGTAAACCACCGAAAAAGGCGCAGGGCATAGATCAAAGGACTTTACAGATAATGAAACTAGACAAACTGTTAACTTTAGTCCGGGACCCACCGTTTCAGACCGGGGTTGTCGATGAAAAGTTTGATGATGATTGA

Protein sequence:

>DPOGS214667-PA
MAPKKKMLRAMVKRALKRDDDTVLITKTLLRLDQKESSKSRSNLADDEALEWLKPRTLLKPDEQEDVPDAELDEEVGRFLTTINPQWVADEVAYHYGQGKFVKKPKPSFGKAYPLYIYQGTAIRKDSEEAKAQIAAGYGMVGFDDAGPKKAKVKKSAAEEEEEEEPEPEPPPPEETKHEEEVQEEEKPTETVEEQEAHPIEEENEEAAEAEDVTEEATPEEKAGGHEEEVIAIKPRVKKLANQFNFCERAALTYNLPRRSVETQTIPPPRATYGATVLQCIIFDFYQEDYARKQREKEEEKPKRLRKKERKHAKSAQQIHDEQLAQRIREAWTILERLVNQNIYDDIAQDYRYWDDPSDEFREGMGSLLPLWKFQFEPMRSHAVCDVQWNPHYQDLFAVAYGSLDFTQQQKQGCLCLYSIKNPAYPEYAVITESPVICLDVFKETPYLICVGLYDGNVCVYNAQLTLESSYQYKSDSVRDKHSNIVWEIRWGPRLIDGEASFFSISGDGRVVQWAIMPGELQATTIITLRTDLPPLPGPDGTLLTVNSCGSCICFHPEKADIFLVGTEDGMIYTCSLKYNRNYVRSVQGHHMPVYRIHYNYYNNSIYASCSGDWRVKIWEDGRDEPLFMFELGSPVGDVKWAPYSSTVFAACTADGKVYVYDLNVNKYRPICVQAVVSKKTKKLTRIDFNPILPIVVCGDTKGTCHVVKLSPNLRVMCKPPKKAQGIDQRTLQIMKLDKLLTLVRDPPFQTGVVDEKFDDD-