Monarch geneset OGS2.0

DPOGS215199
TranscriptDPOGS215199-TA2820 bp
ProteinDPOGS215199-PA939 aa
Genomic positionDPSCF300143 - 138778-148654
RNAseq coverage244x (Rank: top 42%)
Annotation
HeliconiusHMEL0038681e-13447.56% 
BombyxBGIBMGA008679-TA9e-9049.18% 
DrosophilaNdc80-PA8e-0824.88% 
EBI UniRef50UniRef50_UPI00022467FA4e-1426.19%UPI00022467FA related cluster n=1 Tax=unknown RepID=UPI00022467FA
NCBI RefSeqXP_001946571.13e-1128.63%PREDICTED: similar to kinetochore associated 2-like [Acyrthosiphon pisum]
NCBI nr blastpgi|3454840551e-1326.19%PREDICTED: kinetochore protein NDC80 homolog [Nasonia vitripennis]
NCBI nr blastxgi|3863809722e-2030.24%acyl transferase [Streptomyces tsukubaensis NRRL18488]
Group
KEGG pathway 
InterPro domain[302-421] IPR0055503.1e-08Kinetochore protein Ndc80
Orthology groupMCL25528 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215199-TA
ATGTGGAAGAGTGGTAATTGGGTGAAATATGAAACTAAAGCTTTCCCTTTGGTATTAAGACTTCACTATGATCAGCTTCTTATTACAATGGAGGAAAAAGTTATAGAATCACTTAATATAAATAGCAAACTAAAAGGGATTTTCAAAGATGACGTACTTATATTAATTATTAGCGTTATTGGACAGGAGGTCAGAAAAATTAAATTTCAAATTAAAGAAAATGTTAACCAGTGCTTACAAAAATTAGGTTTAAGATTCCCTGTGTTAAAATTTGGTTCAGAGCCTTCTAGAAAAAAATTTAAAGGAAAATATCGTAATGTAGAGGACATATTAAAGAAAACATTTGAAGTAAAAGATTCATTGCATACATCCATAACGCCGGCTGAATATAAAAATTTCGTAAAATTGTCTCTTCTAGATCCATTATTTCCGCAACTTGTGATTGAAACCGAAGACATAGTAAATGAATATTTAGAAAAGACTGATAGAATTGAGTTGTTTTTAGCTTTAATCATGATGTCTAATAGATATACGTTGGGACAATCTTCAATACGGAAATCTAGATTAGAAGGCAAGCCTTCTCTTCTTCCTAAGCCTCGTCGGCCGGGTTCAAGTGATCGATTATCAACGGAGACTCGTCGTCCTTCTGCAGCTGGACACAGATCTAGCAGCGCTGATCCACCTAGAGGAACATTCGGCCCTCGGTTGAGCAGAGAGGCTTCTGCTACGAAGCTACCTGTGAATGGGAGGTCTAGATCACAAACAGGTGACGCTAAGTATGGTGCGGCCACCTCAACTCCATTGCGAGTAAGCCACTACAGGTCTACTACTACTCCACAGAGAACCCCTTCAGAGGATCGGCTGAATCGTGATTGGAAGACCTGTCTTGAAAGAGCATTAGCCTTTGTGACCATCAAAGATCAGAGGCCAATATCTAATGTGGCCTGGCAGCGTTCTGAGTGTGCGAGGGTGGGTGAGGCTCTGGCCAAGCGAGAGTCTTCGATGGCTCTGATCCGTCCGCTGACTATCACACGCTTTGTGGACACCGTCGGGGCTCTACTGACTGCTATCACCAAGGATGCTAAACTTAATAATGACAATTACGTTACTAAACTGCCTCACCTGTCTAAGCGTGTGTTGTACCCCGGGCCAGTCTCCAAGTCCTGGCTACGTACAGTCAACACTCTTCACGCCTTCCCTCATGCGCTGGCTCTGGTGGCCTACCTCCTGGACCTGGTGACACACATAGAGAGTCCGGTGGAGGACGACTGGCTTTACATCAGTAAGGACGACTTGAGCTGTCTGCGGAGAGACTACTTGTATAAGTGCTGGATAAGGCCAATATCTAATGTGGCCTGGCAGCGTTCTGAGTGTGCGAGGGTGGGTGAGGCTCTGGCCAAGCGAGAGTCGTCGATGGCTCTGATCCGTCCGCTGACTATCACACGCTTTGTGGACACCGTCGGGGCTCTACTGACTGCTATCACCAAGGATGCTAAACTTAATAATGACAATTACGTTACTAAACTGCCTCACCTGTCTAAGCGTGTGTTGTACCCCGGGCCAGTCTCCAAGTCCTGGCTACGTACAGTCAACACTCTTCACGCCTTCCCTCATGCGCTGGCTCTGGTGGCCTACCTCCTGGACCTGGTGACACACATAGAGAGTCCGGTGGAGGACGACTGGCTTTACATCAGTAAGGACGACTTGAGCTGTCTGCGGAGAGACTACTTGTATAAGTGCTGGATAAGATTCCAACACCCAGGACACCAGTTCGAGGATCTGAATGAAGAGTATCTGGTGAACTTGAAGTCGTTGCTGGGGAACGACGAGGAGAAGATCAAGGAGTTGGAGCAAGTTATACTGAAATACTCTGCGTGTCTGAATGACGAGGCGGAGGCGGCCGCTCGGGCTGAAGTGGCGCGGCGCGAGGAGAGGGACACGCGGGCCGCCAGTGAAGCCCTACGGGCAGAGGCCAGGGACGCGGACCAGGAGGCTAAGATGATAGACGCTGACGTCGACAGAATCAGTGCGGAGCTGGAGCGCGTGACGTCAGACCTGGAGTCGCAGGTGATGTCTCGTGAGCAGCGCGCCAGACTGCTGGACGACTTGGACTACGCCGTCAGGGTGCACGACTCCAAGAGGACGCTGGCCGACGAGATACAGAAGATGGTGTCAAGCAAAGAGACGGAGCTGGCTCTGTGGCAGAAGAAGACCCTGGAGAGCTGCGGGGAGTACCGCCAGAGGCTCATACACCTGGCGGGGCTGCCGGCGCTCAACGCACTCGCCATCGACGAGAATTCGTTGATGGGTGCCGAGTGTGTGTCGCTGGTGACCCTGGCCGTGGACACGCTCCGTGAGGAGGCCAGCCGTCTGTCCGCTAAGAAGAATGAACTCCTGAGGACCAGGAGCGCTCTGGCCAGGAAGAGAACTGCTATGATGGAAGAGGCGCGGTCGAAGATCGGCGAGGTGGCCGCGACCGTGGAGAGAGAACAGAAGAGTCTAGAGGGAGAGAGAGAGAGGGAGGCAGAGGAGGTGAGGAAGTGGACCAAGCACGAGGGAGAGACGCGCACCAGGCTACAGGAGCTGGAGCACAGGATACAGCAGTACAGGAGCGTGGCCGACCACCTCGCCTACTGGGAGGACAGGGACGGACTGTGGAGATCCAAACTGTCGGAGTTGAAGGAATATATAGAGCAGCAGAAGATTGTGATGCAGAAGAGGCTGGAGCAGGGGAGGGCGAGGAGGGCGCAGCTGGTCGAGGGTACCCTCCGCCTGTGGAGGGAGAAGCTGGGCGGGAACAGCGAAATACACAGGGAGTGA

Protein sequence:

>DPOGS215199-PA
MWKSGNWVKYETKAFPLVLRLHYDQLLITMEEKVIESLNINSKLKGIFKDDVLILIISVIGQEVRKIKFQIKENVNQCLQKLGLRFPVLKFGSEPSRKKFKGKYRNVEDILKKTFEVKDSLHTSITPAEYKNFVKLSLLDPLFPQLVIETEDIVNEYLEKTDRIELFLALIMMSNRYTLGQSSIRKSRLEGKPSLLPKPRRPGSSDRLSTETRRPSAAGHRSSSADPPRGTFGPRLSREASATKLPVNGRSRSQTGDAKYGAATSTPLRVSHYRSTTTPQRTPSEDRLNRDWKTCLERALAFVTIKDQRPISNVAWQRSECARVGEALAKRESSMALIRPLTITRFVDTVGALLTAITKDAKLNNDNYVTKLPHLSKRVLYPGPVSKSWLRTVNTLHAFPHALALVAYLLDLVTHIESPVEDDWLYISKDDLSCLRRDYLYKCWIRPISNVAWQRSECARVGEALAKRESSMALIRPLTITRFVDTVGALLTAITKDAKLNNDNYVTKLPHLSKRVLYPGPVSKSWLRTVNTLHAFPHALALVAYLLDLVTHIESPVEDDWLYISKDDLSCLRRDYLYKCWIRFQHPGHQFEDLNEEYLVNLKSLLGNDEEKIKELEQVILKYSACLNDEAEAAARAEVARREERDTRAASEALRAEARDADQEAKMIDADVDRISAELERVTSDLESQVMSREQRARLLDDLDYAVRVHDSKRTLADEIQKMVSSKETELALWQKKTLESCGEYRQRLIHLAGLPALNALAIDENSLMGAECVSLVTLAVDTLREEASRLSAKKNELLRTRSALARKRTAMMEEARSKIGEVAATVEREQKSLEGEREREAEEVRKWTKHEGETRTRLQELEHRIQQYRSVADHLAYWEDRDGLWRSKLSELKEYIEQQKIVMQKRLEQGRARRAQLVEGTLRLWREKLGGNSEIHRE-