Monarch geneset OGS2.0

DPOGS202562
TranscriptDPOGS202562-TA1203 bp
ProteinDPOGS202562-PA400 aa
Genomic positionDPSCF300355 - 148213-150012
RNAseq coverage152x (Rank: top 53%)
Annotation
HeliconiusHMEL0131982e-17369.60% 
BombyxBGIBMGA004339-TA2e-14769.05% 
DrosophilaCG11788-PA2e-8037.62% 
EBI UniRef50UniRef50_D6WSW43e-8847.49%Putative uncharacterized protein n=3 Tax=Endopterygota RepID=D6WSW4_TRICA
NCBI RefSeqXP_968072.12e-10148.47%PREDICTED: similar to CG11788 CG11788-PA [Tribolium castaneum]
NCBI nr blastpgi|910861155e-10048.47%PREDICTED: similar to CG11788 CG11788-PA [Tribolium castaneum]
NCBI nr blastxgi|910861152e-9848.47%PREDICTED: similar to CG11788 CG11788-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[51-366] IPR0191286.2e-82Sister chromatid cohesion protein DCC1
Orthology groupMCL13466 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202562-TA
ATGGAATTTACAAGGGTTGAATTCGAAACAGCAGAGAGAACTCCTGATGAAGTAAGGAAAGTTATAAAAACGGCAAAACTTCACGAGTCTGAATTAACTGACATCACCCAAGTATTGAGATTTGCGGAACAAAATGTTCATAATTTAAATTTAAAACTGATGCTTCTAGATGATAATTTACTCCAAGAAATAGAAGCTGGAAACCAGTTAATATTTAAAGGTGATGTTGATGAAAATGCTGTTTTGTGTACACAAAGCCGAACTTACGATATTAAGGAAGCTGAAACATCTAACTCCCTCCATCTCGTTCCTGACCTGCTGTTTGCGGCTTCAACCGACGACGGGAGGCCTCCAAGGAGTATCGTGCACAAGGATATACTGAATACATTTTTTACGTATTACGAAGTAAAGCCGTGCAAGCCGCGGCTGCTTAAGCTACAGAAACTACTAGAGGAAACTTCATATCGCGGCTTGGAACTAGAGTATGAGGTGGACAAAACCAAGTTACTGACCTATGAAGGTATATTTGATGTCATCCAAGCTTCTCGAGCTGAGTTAGATGAGGAGTTGGTGAGGCTGCAAGCTTTGAAAATAGGAGAGCACTATCGATTACTAGATTTCGATTACGAGTTTAGAATTTTATCATACATGCTGGACCTGATAGAAGAGAATTCATGGCCATTAAATAAAATATCGAGAGAGGTCACTTTAGATAGCTTGAAAGATTTAGTTCCTGCTTGTATTTTGGAAGCGATGTTTGGATTTTATACGATGGAATCAGTAGAAGAAGGAGGGACGCAATACTATCAGTACAAAGAAGACAAAGTTTGCCGGTTTCTGGCGCGAGTCCTTTTGAAAAGCGCGGGAAAGTTTAATTTAGTTGAATTCATGCAAGCGTGGCGGGATTCCGTACCCGAAGGAATGATAACACATAAATCCATGCTGGCCGGGATAGCGATAACGGACGAATCCTCACAACCACCAGTTATATGGGGTTTCTCTGCGAGTGATTTGCCCGAGGACTTGAATCAGCGCTTTAAGATTCTGTTCCAAGCGAAACCCAAATGGACCCTTTCCGAAATATCGCCGTATATTGAGTTATACGCCACTGAGAAATTAAATGTAAACGCCCTATTAACCAAGTACGCACGCGCTTCAGCTCAAGATGGCGTCAGAGTTTTTTCTGCTAAACATATGAAATAA

Protein sequence:

>DPOGS202562-PA
MEFTRVEFETAERTPDEVRKVIKTAKLHESELTDITQVLRFAEQNVHNLNLKLMLLDDNLLQEIEAGNQLIFKGDVDENAVLCTQSRTYDIKEAETSNSLHLVPDLLFAASTDDGRPPRSIVHKDILNTFFTYYEVKPCKPRLLKLQKLLEETSYRGLELEYEVDKTKLLTYEGIFDVIQASRAELDEELVRLQALKIGEHYRLLDFDYEFRILSYMLDLIEENSWPLNKISREVTLDSLKDLVPACILEAMFGFYTMESVEEGGTQYYQYKEDKVCRFLARVLLKSAGKFNLVEFMQAWRDSVPEGMITHKSMLAGIAITDESSQPPVIWGFSASDLPEDLNQRFKILFQAKPKWTLSEISPYIELYATEKLNVNALLTKYARASAQDGVRVFSAKHMK-