Monarch geneset OGS2.0

DPOGS205570
TranscriptDPOGS205570-TA3387 bp
ProteinDPOGS205570-PA1128 aa
Genomic positionDPSCF300099 + 361494-365104
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0086480.080.13% 
BombyxBGIBMGA003083-TA0.071.48% 
DrosophilaCG14838-PA5e-9027.56% 
EBI UniRef50UniRef50_UPI000224786B2e-12027.45%UPI000224786B related cluster n=1 Tax=unknown RepID=UPI000224786B
NCBI RefSeqXP_001599307.15e-13128.16%PREDICTED: similar to testis development protein nyd-sp29 [Nasonia vitripennis]
NCBI nr blastpgi|3454926918e-12027.45%PREDICTED: WD repeat-containing protein 63-like [Nasonia vitripennis]
NCBI nr blastxgi|1953757803e-10827.52%GJ13015 [Drosophila virilis]
Group
Gene OntologyGO:00055152e-17protein binding
KEGG pathwayaga:AgaP_AGAP0115393e-11 
 K11143 (DNAI2)maps-> Huntington's disease
InterPro domain[380-812] IPR0110462e-17WD40 repeat-like-containing domain
[720-858] IPR0159431.5e-13WD40/YVTN repeat-like-containing domain
Orthology groupMCL11542 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205570-TA
ATGAGCGATTCTGATAATTCTCAGGATGAAGTTAAAATTTGTTGTAAAAATGTTCAGAGCAATACATCTGATTCTAGCTCGGACGACGATTTTGAAATTCAGAGAAGAGATCAATTTTCGGGTGAGAATATGACATCTGCACAACCATTGCCCAGAAGAAAAAGATTCATACCTAAGTACGACGTAGAAGGTGTTTACAAAGTGATACTTGGAGAAGAAACGCAAATACAAATAGATATGGTGATTGGAACTCATATTAATGCTCAATATCCCTGGAAGTTAGTAAGTAAAGATAAAATTTTGGAAACAATTGATGTAGGTGGTGAATTATCTGAGTTTTTTCCAATAAAAGACGTCTTACATAAATATCCAGAAGCAGATGTGCTTATGGGGTATATTGTGGATGAGTCTACTGACGTCGATGAATTTTATGTTTGTTGTACTACAGAAGCTAGAGATCATTGTAAACAACAATCTGACAGATTTATTAAAAGGCAAGAAAAAAAATTAGAAAAGGCCGTCCAAAAAAAGCCAAGACCGTGGAAAAGTCTTGGAAGTGAAACTGAAATAATTGAGTTATTACCTATTAATACAAGACCTTTATTAGAAATAGAAATAAAGGCCAAATTTCCTAAAGAATATCAACCTCAAAAATTTACAGTAAGAGACACAAGTCAGGTACGCGATGGTTTTATAGAACTAACGCCATACAGACAAAAATTTAATAATGTCTTCCGACGTAGAGTTGATTGCGCGGTTCAGGCTGCTCCACCGAAAGTTCACAATGTTGCCCAAACCGTGCTTAGATACCCTCAGAATATGTGGACCCAATCAATAGCTGATGCAATGGGTGCTATAGAGGATACCGAAGCTGGTGGGGATACAGCTGTTGAAAAAACAGATGAGATTCAGGAAGCGGTGAAAAGTTCAGACGAAGAAGAGGAAGAGGCAGTTCAGGCACCTGTAGATGAAGTTGGTTTGTTAGCTTTTAAAAATGCATTACAACGCATGCGCCGTGCTTCGTATGTAAAAAAAATTAATAATTTCATGCAAAATAAACAAAGTGAAATGGATTCTGTAATAGAACTCAATACTGTCATGGATATGTACTGTAATGACTACCCGAATTTAGTCGCTAAAAAAACTGTTGATATATATGACACAATGACTTTTGAGGAATACGTATGTTTTACCGACGTTAGAGCTAAAGATAAATATATTTCTTCTGCCGTTTTCCATCCAATGTGGTCTGGTATAGTTGCCATTTGTTACTCAGATGGATCCCCAACAGTGATGAAAACACTTAGCTCGCGTCCGGATCCCATTCAAAGAGCCGTCTATGGTCTAAATCCTGTATTAATATGGAGTCATATAGATAGTTTATTACCCAAGCTATACCTAGAATCCCCAAGAGAAGTAAAAGTATTGTCTTTTTGTCCATTTGATGAGAATATTTTAATTGGTGGCTGTGTTAATGGACAAATAGTTATATGGGATATACAAAATAAATTAGAAAATGTTGAGAAAATAGAAGTTCTTAGTGAAAAAAGAGAGAAATATAAAATAGCCATGAATGCTCACATGGGCTGGATGAAAAATGTACATGATGATACAATAGTTCAATCCACAGTCTTAAGTAATTTGATGACGAGTCACTATGGACCTGTGACATCAATTAATTGGCTATCTCCAAATTTTTCTGTAACACCCACTGGAAAAACTCACTTGATAACTGATAATAAAAAATCTCTTATATTTTTTACGGGATCTGAAGACGGCTTAATACTAATTTGGAATTTATCTGTTGAAAATGTTACACTTTTTGAAGGAAAAAAAGTGAAGAAATCTAAGCGAGTACTTAAAAGACCATCTGGTCTATTGGTTGATTTTTCACCATTTAAAATTTTGGATCGAAACTTACAACCTTGTTATAAAATTATTTTAGGAGTTGCCGGTCAACCTCAATCTTTACCACTACAAAGTTTTGGAATGAATTCCCCTGTAATAAAATATACCTATGTACCAAAGAATACTGGGACTGGAAGAAAATATTACACGTCTGAAATTATACCTCAAGGAGAATTTGAAATAAATACAATCTTGTACTGTGGATCTCAGCACGGCGAATTGGCTAGAATTACGTGGGAAGGTCATGAATTCAATACTGGTGAAATAGTTAACTCGGAGTATTGTGATATAGCATTTAGTTGTCACATACATGATGGTATTATATCCCGTTGTCAGAAAAATCCCTTTATTAATAATATAACATTAACTGTTGGAGGTAAAATTTTTGCTATTTGGTCTGATAAACTAAAAGATAGACCTTTAATATGGAAAAAGCGTCCATCAAGGCTTACTGATTGTGCTTGGTCCTTATATAAACCGAGCTTACTATTTATTACTACTTCTGAGGGTGATTTAGAAACTTGGGACTTACTACTTAGGAGCGATCTACCTATAGCCGTTCAAACTTTATCAGGAAATATGCTTACTAGTGTAAGTTTACACACATTGCCAATAGCTAAAAATATTATTGGAGTAAGTGACATAAATGGATCATTCAGGATGTTTTTATATCCACCAATATTTATGATTGAAAATCCTACTTATATAGGTCGAATGGAAAGCATGATAATACGGGAACTTAAAGTATTGAAAACTTTTATTTCGTGGCAAGAAGAATGGATGCGTAACAATCCAGAAATTTTATTAGAAATAAGAAGAAAAGAGGGAGAACTGATGGCTATTAAAGAAGAAGAGAAACGAAAGTTAAAAGAAGAGGAGGAAAAACAACTAGAAGAAGAAGCAGAAGCACGTCGACAGCAAAAATTAAAAGTATTAGGTCCAGAAGAAAGATGGCAAAAGATTATTCAAAAGCTAATAGAAAAAACTATAGCCGTTAAGAAGAGAATAAACAGAGCAGAACTTATTGAACATGAAAAACCCCTTAGAGAATTAGAAGCACAAAGGTTAGAAAAAGAACGCAGAATGCTTGAAATTATGAAAAATCAAAAAACTATTTTCAATGACACCGTAGCCATATTATTTCCTGAAGCTATTAAAAAAGAAGTTAAAGTAAAAAAAAGCTACCTAGGCGATGACAAGAGAACATTAAAGAGATCTTATTTAGAAAATTACGACAATTTAAAGAATACAGCGAATATAACAGTCAGTAGACATCCATATAAAATAGATTTTTCTTGGGAAAATACATTAGCTGAAGGCAAAGAAAGGCGACAAGCATTGAACGCCCATGATGATTTTATAAAAATACATAAAGGAAGAATAGAAGAAGAAGGCAAAGAGACCCTTACATTACCATCGGTCTTAAGCACAGCTGTAACTGAGGAAGATGAAAATCTTGAAAATGAGGAAGATGATTGA

Protein sequence:

>DPOGS205570-PA
MSDSDNSQDEVKICCKNVQSNTSDSSSDDDFEIQRRDQFSGENMTSAQPLPRRKRFIPKYDVEGVYKVILGEETQIQIDMVIGTHINAQYPWKLVSKDKILETIDVGGELSEFFPIKDVLHKYPEADVLMGYIVDESTDVDEFYVCCTTEARDHCKQQSDRFIKRQEKKLEKAVQKKPRPWKSLGSETEIIELLPINTRPLLEIEIKAKFPKEYQPQKFTVRDTSQVRDGFIELTPYRQKFNNVFRRRVDCAVQAAPPKVHNVAQTVLRYPQNMWTQSIADAMGAIEDTEAGGDTAVEKTDEIQEAVKSSDEEEEEAVQAPVDEVGLLAFKNALQRMRRASYVKKINNFMQNKQSEMDSVIELNTVMDMYCNDYPNLVAKKTVDIYDTMTFEEYVCFTDVRAKDKYISSAVFHPMWSGIVAICYSDGSPTVMKTLSSRPDPIQRAVYGLNPVLIWSHIDSLLPKLYLESPREVKVLSFCPFDENILIGGCVNGQIVIWDIQNKLENVEKIEVLSEKREKYKIAMNAHMGWMKNVHDDTIVQSTVLSNLMTSHYGPVTSINWLSPNFSVTPTGKTHLITDNKKSLIFFTGSEDGLILIWNLSVENVTLFEGKKVKKSKRVLKRPSGLLVDFSPFKILDRNLQPCYKIILGVAGQPQSLPLQSFGMNSPVIKYTYVPKNTGTGRKYYTSEIIPQGEFEINTILYCGSQHGELARITWEGHEFNTGEIVNSEYCDIAFSCHIHDGIISRCQKNPFINNITLTVGGKIFAIWSDKLKDRPLIWKKRPSRLTDCAWSLYKPSLLFITTSEGDLETWDLLLRSDLPIAVQTLSGNMLTSVSLHTLPIAKNIIGVSDINGSFRMFLYPPIFMIENPTYIGRMESMIIRELKVLKTFISWQEEWMRNNPEILLEIRRKEGELMAIKEEEKRKLKEEEEKQLEEEAEARRQQKLKVLGPEERWQKIIQKLIEKTIAVKKRINRAELIEHEKPLRELEAQRLEKERRMLEIMKNQKTIFNDTVAILFPEAIKKEVKVKKSYLGDDKRTLKRSYLENYDNLKNTANITVSRHPYKIDFSWENTLAEGKERRQALNAHDDFIKIHKGRIEEEGKETLTLPSVLSTAVTEEDENLENEEDD-