Monarch geneset OGS2.0

DPOGS207525
TranscriptDPOGS207525-TA2136 bp
ProteinDPOGS207525-PA711 aa
Genomic positionDPSCF300177 + 86248-92611
RNAseq coverage338x (Rank: top 34%)
Annotation
HeliconiusHMEL0055150.077.61% 
BombyxBGIBMGA001887-TA0.070.41% 
DrosophilaMad1-PA3e-6729.01% 
EBI UniRef50UniRef50_E2AIN42e-6931.13%Mitotic spindle assembly checkpoint protein MAD1 n=9 Tax=Formicidae RepID=E2AIN4_CAMFO
NCBI RefSeqXP_968537.21e-8432.08%PREDICTED: similar to Rs1 CG2173-PA [Tribolium castaneum]
NCBI nr blastpgi|2700100442e-8332.02%hypothetical protein TcasGA2_TC009389 [Tribolium castaneum]
NCBI nr blastxgi|1892387019e-9732.02%PREDICTED: similar to Rs1 CG2173-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[46-705] IPR0086723.5e-74Mitotic checkpoint
Orthology groupMCL13322 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207525-TA
ATGGCTAAAGAAGGAGACATGTCGCTGTTTAACGATGTGTTGGAACCGTTTAGGCGGGTTATAAACACCGACTTGCCAAAGGATAAGTTGTCGGCGTCTGCTAAATTAAATTTTTCTGATAGTAATCAATCTATCAAAGAAGGTTCTACAACTTTATTTTCGTCTATCAAGAGGAAATCTGGCTTAGAAAATCCAACTCCTGAAAAACGAATACGTACAGAATCAGGATCGAATTCTAATGGTCCACCTCCATCTCCATGGGAAGCTAAAAGACTGAAAATTGATTTGATAGCAGCCAAAGCACAGATAACCAAGCTAGAAGCTCGTGTTAACCATCAGCACACAATACGTAAGGAAATGCAAATACTGTTTGAGGAGGAGAAAGCCTCTTTGATAGAACAACACAAACACGATGAAAGGGCTCTGTCGGATATGGAAGATCGTCTCAAGTTGATTAGACGAAGGGAACTTGAAATTAAAGAAGAATATAACACGGCAACTAAAGAACACAAAGAGGCTAAGCTGAAATGGGAAAAAGATAGGAGTGATCTTCAAAAACAGATAGCCGAGTTAAAAGACAAGTTGCTAGAATCTGATGTGAGCAGTAAGGATCAGTTGTCGGAGATGAAGAGAGATATGGACGAATTGTTACAGGCTCTAGAAGGAGCACAACAGGAGGTTCAAATTCTGAAAAAGGAAGTATCAAAACAAACGGCCAGGGCTGAACAATGTTCAACTCTCCGTTCACAGTTAGAGAAACAGACATTTGAATTGCAACAAATTACTAACAAATTGAAAGAACTGGAGTACGAGAAAGACTCCTATAAGGATTGGCAGCAACAAGTTAAGACGGCCCAGAAACGTCTTACAAATATGGCAGAGTTGGAGAAGGAAGTAGGGAGATTACGAGCAGCTGAGCGATCACTCAGAGACTCAGTGAGTAACAAGTTACTGCTTGAAGAACAAGTCCATATACTGAACACCAAGGTGGAGACATTACAACCAGTCCAGCAGGAATTACATGACGCTAAGGTTAAAATAGCATCACTGGAGTCTACTCTGGAAGAGTGGAAGAATGTTGCTCGTACACATGGCATAGAGAATGCCAGGTCGCTATCATCAGCTCTGGACTCGGCTCTCAGCAGTCAACTGACCGCTGTGGCTGGATGTTCACAAGCACAGTCACAGTCTGCTCAGCTCAGTGAGGAAATTGCTACAGTCAAGTTCGAACGCGATAAGGCGACTACGAAGCTGAACGACCTGCAAACAGTTCGTAAGAACCAGGAGAGTCTGATACACCGGCTGCAGAAACGACTTCTGCTGGTTACGAGAGAGAGAGACAGCTACAGACAACAGCTGGACTGTTACGAGAAGGAACTGACGGTGTCTATAAGCGGAGCGGGCGAGTCCGCGGGCGGGGCCGCACTGCTGGCGGCGAGGGTCGAACAGCTAGAGAAAGCGTTGCAATCCTACCGCGACCTGCTAGCGTTACACGACCACGATCCAAACGCTAAGGCCCTGGAATTGGCTCGAGCTGAAACATCTAAGTACCGGGAAGAAGCGGAGGCCGCTAAACGGGAAGTCGCCAAGCTTAGAGCCCAAAGAGACCAACTGCAGATGCATTTTGAAAAACTTGCTGCGCCTACAAAAATACTACACTTAGCTGACAACCCTGCAGCGGCCGCTCAGAAACAGATGCAACTTGAATTGGAGTCGGCGCAGGAAGAGATAAAGAAGCTGAAGGCTGCACTTCGCGACGGAGGGTCCAGCGCTGGTGATGACGCGCTCCGAGCACAGCTTGAAAATAGTCGCATCAAGTTACAGAGGATGAAAGAGGAATTTACATCTTCAGCCCAAGAGTATCGTGATGTGTGTTACATGCTGCTAGGCTACAAAATAGATAGAACTGGACATAAGAACTATAGGATATCAAACATGTACGCTGAGTCCTCAGATGAGTATCTGACATTCACGCTGTCTGATGATGGCATTGAAATGGTACATACGGATTATTCAACAACATTGGGAGACTTGGTCGAGTTGCACTTGCATCAGAACAGATCGATACCCGTGTTCCTCAGTGCTCTGACCATGGAACTGTTCACCAGAATAACTATGCAGCAGACGCAATGCTAG

Protein sequence:

>DPOGS207525-PA
MAKEGDMSLFNDVLEPFRRVINTDLPKDKLSASAKLNFSDSNQSIKEGSTTLFSSIKRKSGLENPTPEKRIRTESGSNSNGPPPSPWEAKRLKIDLIAAKAQITKLEARVNHQHTIRKEMQILFEEEKASLIEQHKHDERALSDMEDRLKLIRRRELEIKEEYNTATKEHKEAKLKWEKDRSDLQKQIAELKDKLLESDVSSKDQLSEMKRDMDELLQALEGAQQEVQILKKEVSKQTARAEQCSTLRSQLEKQTFELQQITNKLKELEYEKDSYKDWQQQVKTAQKRLTNMAELEKEVGRLRAAERSLRDSVSNKLLLEEQVHILNTKVETLQPVQQELHDAKVKIASLESTLEEWKNVARTHGIENARSLSSALDSALSSQLTAVAGCSQAQSQSAQLSEEIATVKFERDKATTKLNDLQTVRKNQESLIHRLQKRLLLVTRERDSYRQQLDCYEKELTVSISGAGESAGGAALLAARVEQLEKALQSYRDLLALHDHDPNAKALELARAETSKYREEAEAAKREVAKLRAQRDQLQMHFEKLAAPTKILHLADNPAAAAQKQMQLELESAQEEIKKLKAALRDGGSSAGDDALRAQLENSRIKLQRMKEEFTSSAQEYRDVCYMLLGYKIDRTGHKNYRISNMYAESSDEYLTFTLSDDGIEMVHTDYSTTLGDLVELHLHQNRSIPVFLSALTMELFTRITMQQTQC-