Monarch geneset OGS2.0

DPOGS201493
TranscriptDPOGS201493-TA1386 bp
ProteinDPOGS201493-PA461 aa
Genomic positionDPSCF300006 + 418272-432082
RNAseq coverage308x (Rank: top 37%)
Annotation
HeliconiusHMEL0159640.084.88% 
BombyxBGIBMGA002680-TA2e-16078.45% 
DrosophilaRcd5-PA1e-13249.90% 
EBI UniRef50UniRef50_E0VAR76e-14258.78%Microspherule protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0VAR7_PEDHC
NCBI RefSeqXP_624688.21e-15057.93%PREDICTED: similar to Microspherule protein 1 (58 kDa microspherule protein) (Cell cycle-regulated factor p78) (MCRS2) [Apis mellifera]
NCBI nr blastpgi|3228012742e-15162.05%hypothetical protein SINV_06956 [Solenopsis invicta]
NCBI nr blastxgi|3228012742e-14461.73%hypothetical protein SINV_06956 [Solenopsis invicta]
Group
Gene OntologyGO:00055154.8e-14protein binding
KEGG pathway 
InterPro domain[340-449] IPR0089844.8e-14SMAD/FHA domain
[347-447] IPR0002534.4e-08Forkhead-associated (FHA) domain
Orthology groupMCL12127 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201493-TA
ATGGATTTTCCTGCAACGCCATCAAATGCATCGGATAGTAACGATTATCCCGGTGCCACTCCATCGCAGTTTGATACAGCACAAGATAGTACAAAAAGACGGAGTTCGTCTAGATCAATAAAACGTCGGCGTTTCGATGATGAACTGGTAGAATATAGTTTAGGTTTACCAGGTGCTTCAATTGGTAAAAATGAAAAACGAGTTCGGACACAGTCGTATACAAGTCAATTGCCAGAAGCAGTGGTTGTTGGAGCTACTCCTGTGACACCAGCTGCTCCTACTCAGGAGCGAAGACGAGCTTCAAGTAAAATATCAGGTGCAGGAAGCATTGGTCGCAAGTCTCGCCGTAAAGGACAACTAAGTCAGCTATCTACAAAAGATTTAGGAAGATGGAAACCCACAGACGATTTAGCATTGATTTTGGGAGTACAACAAACAAATGATCTGCGTATAGTTCATCGTGGCGTAAAGTTCTCTTGCCGTTTCACAGTAGGTGAACTGCAGTCTCGGTGGTATGCACTATTGTACAATGCAGAAGTATCTCGTGTTGCTCTTGCAGCAATGCGCAACTTGCATCCCGACCTTGTTGCTGCTGTTCAACAGCAAGCACTGTATTCTAATGCTGAGGAAGAGCTGCTTGGAACTTTGCCTAGTAATTCACATCCAGCCATTGAGAAATTCCAAGAGTTATTGGAGGCTAACCCGCACATATTCTATCCGACGCGCACTGCCAAGTCTCTAATGAACCATTGGCAGCTGTTGAAGCAATATCAGCTTTTACCTGATCAGACCGTACAGCCTTTACCTGTTAAAGGTCAGACTGACAATATAATGACATTCTCTGATGCTGAAGAAACCATGAATGACTCGGAATTACCTGATTATAAAGAGGATGGGATTGATATTGAAATGCAACTCGCCGACAGGGTTGAAAAGAAGGATATCCGGTTGCTAGAGAACTGTATGTCCCGCTGGCAGGTTCTAGTGCAGTCGGTGGCTGGTGGCAGCGCTGAGCTCGATAAGAATACTCTGGCTGTGCTGAGAGGACGACTAGTCAGATACCTCATGAGATCCAGAGAGATCGCCGTGGGTAGGAGTACTAGAGACCACACCATCGACGTAGATTTGAGCCTCGAAGGTCCAGCAGCGAAGGTCTCAAGGAAACAGGCGACCATTCGTCTCAGGAACAGCGGTGATTTCTTCATGTCTTCAGAGGGCAAGCGGCCCATTTTCGTTGACGGGCGACCCGTGCTTCAGGGGAACAAAGTTAAGCTGAACCACAACACGGTTATTGAAATCGCCGGTCTACGCTTCGTGTTTCTCATAAACCAGGACCTGATCAATGCCATAACACAAGAAGCCGTCAAAGTTACAATACCAGTCTGA

Protein sequence:

>DPOGS201493-PA
MDFPATPSNASDSNDYPGATPSQFDTAQDSTKRRSSSRSIKRRRFDDELVEYSLGLPGASIGKNEKRVRTQSYTSQLPEAVVVGATPVTPAAPTQERRRASSKISGAGSIGRKSRRKGQLSQLSTKDLGRWKPTDDLALILGVQQTNDLRIVHRGVKFSCRFTVGELQSRWYALLYNAEVSRVALAAMRNLHPDLVAAVQQQALYSNAEEELLGTLPSNSHPAIEKFQELLEANPHIFYPTRTAKSLMNHWQLLKQYQLLPDQTVQPLPVKGQTDNIMTFSDAEETMNDSELPDYKEDGIDIEMQLADRVEKKDIRLLENCMSRWQVLVQSVAGGSAELDKNTLAVLRGRLVRYLMRSREIAVGRSTRDHTIDVDLSLEGPAAKVSRKQATIRLRNSGDFFMSSEGKRPIFVDGRPVLQGNKVKLNHNTVIEIAGLRFVFLINQDLINAITQEAVKVTIPV-