Monarch geneset OGS2.0

DPOGS205298
TranscriptDPOGS205298-TA3609 bp
ProteinDPOGS205298-PA1202 aa
Genomic positionDPSCF300021 + 973020-986244
RNAseq coverage545x (Rank: top 23%)
Annotation
HeliconiusHMEL0162190.075.47% 
BombyxBGIBMGA011039-TA0.094.86% 
DrosophilaCap-PA0.068.74% 
EBI UniRef50UniRef50_B4IF960.067.08%GM13388 n=3 Tax=Drosophila RepID=B4IF96_DROSE
NCBI RefSeqXP_001652359.10.073.59%structural maintenance of chromosomes smc3 [Aedes aegypti]
NCBI nr blastpgi|1571146590.073.59%structural maintenance of chromosomes smc3 [Aedes aegypti]
NCBI nr blastxgi|1571146590.073.65%structural maintenance of chromosomes smc3 [Aedes aegypti]
Group
Gene OntologyGO:00055242.7e-56ATP binding
GO:00056942.7e-56chromosome
GO:00055151.4e-41protein binding
GO:00512761.4e-41chromosome organization
KEGG pathwayaag:AaeL_AAEL0069370.0 
 K06669 (SMC3, CSPG6)maps-> Meiosis - yeast
    Cell cycle - yeast
    Cell cycle
    Oocyte meiosis
InterPro domain[2-1181] IPR0033952.7e-56RecF/RecN/SMC
[479-691] IPR0109351.4e-41SMCs flexible hinge
Orthology groupMCL12237 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205298-TA
ATGCATATTAAACAGGTGATTATTCAAGGGTTTAAGAGTTATCGTGAACAGATAGTGGTAGAACCCTTTGATAAACGGCACAATGTAGTTGTTGGACGTAATGGTTCCGGAAAAAGTAATTTCTTCCATGCTATCCAGTTTGTACTTAGCGATGAGTTCTCTCATCTCAGGCCGGAGCAACGTCTGGCGCTACTACATGAAGGCACTGGACCTCGGGTGATATCAGCTTTTGTTGAAATCATTTTTGACAACTCTGATAATCGAATTCCAATTGAAAAGGATGAGATTTTCCTCCGACGAGTCATTGGTTCTAAGAAAGATCAATTTTTTCTGAATAAGAAAGTTGTGCCAAGATCAGAAGTACTAAATCTACTGGAAAGCGCTGGTCTCTCCAACTCTAATCCATATTACATTGTGAAACAAGGAAAGATTAATCAAATGGCAGTAGCCCCTGATTCGCACAGATTGAAATTGTTAAGAGAAGTAGCTGGTACAAGGGTTTACGATGAGAGGAGAGAGGAATCTGTTACCATATTGAAAGAAACTGTCGGGAAGGTTGAAAAAATTAACGATTTCCTCCAAACCATTGAAGAGAGGCTGAAAACGCTTGAAGAGGAGAAAGAGGAATTGAAAGAATATCAGAAATGGGATCGCGCAAGACGTGTCCTTGAGTTTATAATACATGACACAGAACACAGAGAAAACAAACGGAAACTGGAAGATTTAGAGAAAATGCGTGCCAACAGTGGCAAAGAACAACAACATTATGCCGACATGGTGAGAGAGGCTCAAGATCATGTGAGAGAAGCCAACAGAAAACTAAAAGAAGCCAGAAAAGATGTGGCAGCAACTCGTGAAGAGAAGGACATACTTTCAACTGAACAACAGCAGTTATTGAGAGAAAAGACGAAGCTCGAGCTCGCCATCAAAGACTTGACAGACGATGTTGACGGAGATAACAAGTCTAAGGAAAGAGCGGAAGCTGAGCTGGAGCGTCTCCGTCAGCAGATTTCGGAGAAAGAACGCGAATTAGAGGAACTGAAGCCCAAGTACGAGGAGATGAAGGCGCGGGAGGAAGAATGTACGCGAGCGCTCGCCCTGAACCAACAGAAACGACAAGAGCTATACGCCAAGCAAGGCCGCGGCACACAGTTTACATCCAAACAAGACAGAGATCGATGGATAGAGAAAGAGCTCAAATCACTGAACAAACAAATAAAAGACAAGAAGGACCACGAGAACAAACTACGCGAGGATCTGAGGAGGGACGCGACAAAGCTGACGGAACTGGAGAAGAGAATCGAAGAAATGACTAAGGAAATGGAACGACAAAGGGTCGCCATCGACGAACACAACAAGCAGTACTACGAGTGTAAGAAGAAAAAGGATCAGGAACAAAGCACTAGGAACGAGTTATGGCGTAAGGAGACCACATTAACTCAGAATTTGTCTTCCTTAAAAGAAGATCTATCTAAAGCCGATCAAGCGCTTCGCTCCATGGCTGGAAAGCCCATACTAAACGGCCGAGACAGTGTTAGGAAAGTTCTAGAAACTTTCCAAGAACGTGGCGGAGAGTGGGCCAAGATAGCCACACAATACTATGGCCCCGTTATAGAAAACTTTAGCTGCGACAAAACTATATACACAGCTGTTGAGGTTACAGCCGGCAACAGACTGTTCCATCATATAGTGGAGTCGGACACCGTCGGTACTAAAATATTAAAGGAGATGAATCGTCAAAGCCTCCCCGGCGAGGTGACCTTCATGCCCCTCAATAGACTACAAGTCAGGGACATGGTTTATCCTAATGATAACAACGCCATAGCCATGGTTCAGAAATTGAAGTATGATCCGAAGTATGCAAAGGCTATGAAGTATATCTTCGGTAAGACGCTTATATGCAGGAACTTGGAGTGTGCCACCGAGCTCGGGAAACAGTTTCACCTGGACTGTGTCACTCTAGAGGGGGATCAGGTATCATCGAAAGGGTCACTCACTGGTGGATATTTTAATCAATCCCGATCCCGTCTGGAGATGCAGAAGACGAGATCGGAATTGATGGAGCAAATAACATCGCTGGACGAAGAGTTGAACACCTTGAGACAGGAACTCAATAAAACGGAGACCAGCATTAATACTATCGTATCTGAAATGCAAAGGACTGAAACGAAACAGGGGAAGGCTAAAGATATATTTGACAAAGTCAAAGCTGATATTCGTTTAATGAAGGAGGAGCTTGCGTCCATAGAACGATTCCGCGGACCTAAAGAAAGATCGTTGGCACAGTGTAGATCCAGTCTGGAAGCCATGCAGGCAACCAAGGAAGGGCTCGAGTCAGAATTACACCAGGAACTCATGGAGCAGCTATCTATAGCTGATCAAGGTAAGGTCGACGAACTGAACGACGCCATCAGACGTCTGACACAAGAGAACAAAGAGGCCTTCAGTCAGAGAATGAACCTTGAAGCCACCAAGAACAAACTAGAAAACCTGCTCACTAACAACCTTATACGTCGTAAGGACGAGCTAGTACAAGCGCTTCAAGAGATCTCAGTGGAAGATCGCAAACGTCGTTTGGCGACAAGTAAGGCGGATCTCACCGGCGCTGAGAAAAGAATCAAACAGATCAATAAGGACCTCGAAGATGTCGAGAGAAAGGTCCAAGCCGCGGTCAAGACGGAAAAGGCCCTCAAACTGGATCTAGACAAATGGAGGAATAAGGAGAAAGAAGCACAAGACAAAATGGAAGAAGATGCCAAGGGTCTGGAAAAGATGGCGTCCAAAGAAGTGTTATTACAAGAGAAAATACAAGAGTCATTGGACAAAATAGCGGCCCTAGGAACCCTGCCTAACGCTCCCGAACTACATTCTAAGTACCAGAAACTATCTTTAAAACAGCTATTCAAAGAACTGGAGAAGGCCAACCAACATCTCAAGAAATACAACCACGTGAATAAGAAAGCTTTGGATCAGTTTATAAGCTTTTCGGAACAGAAAGAAAAACTGTACAAGAGGAAGGAGGAACTCGATATTGGTGGCGAAAAGATCCGTGAGCTCATCGAGACGTTGGAACATAGGAAATTAGAAGCGATACAGTTCACGTTCAAGCAAGTCAGCAAGAATTTCACTGAGGTGTTCAAAAAGTTGGTCCCTCAAGGCAGAGGGAGTCTTATAATGAGAGTGGCGCCCGACGAGGGTCAAGATATACCGGATCGAGCTAACGCGGACCCGTTCACTGGCGTGGGTATCAAAGTATCGTTCACTGGAGGTGAAGGTGACATGAGAGAAATGAACCAATTGTCCGGAGGTCAGAAGTCACTGGTCGCCCTAGCACTCATCTTCGCCATACAGAAATGTGATCCAGCACCTTTCTACTTGTTCGACGAAATTGATCAGGCTCTAGACGCTCAGCATCGTAAGGCGATCGCGAACATGATCCACGAGCTGTCGTCGTCGGCTCAGTTCATAACGACAACTTTCCGTCCGGAGTTGTTGGAACACGCCCACAAGTTCTACGGAGTCAAGTTCCGGAACAAAGTGTCGCACGTGGAGTGCGTCACGCGCGACGAGGCCAGGGACTTCGTAGAGGACAGTGCTACGCACGCGTAG

Protein sequence:

>DPOGS205298-PA
MHIKQVIIQGFKSYREQIVVEPFDKRHNVVVGRNGSGKSNFFHAIQFVLSDEFSHLRPEQRLALLHEGTGPRVISAFVEIIFDNSDNRIPIEKDEIFLRRVIGSKKDQFFLNKKVVPRSEVLNLLESAGLSNSNPYYIVKQGKINQMAVAPDSHRLKLLREVAGTRVYDERREESVTILKETVGKVEKINDFLQTIEERLKTLEEEKEELKEYQKWDRARRVLEFIIHDTEHRENKRKLEDLEKMRANSGKEQQHYADMVREAQDHVREANRKLKEARKDVAATREEKDILSTEQQQLLREKTKLELAIKDLTDDVDGDNKSKERAEAELERLRQQISEKERELEELKPKYEEMKAREEECTRALALNQQKRQELYAKQGRGTQFTSKQDRDRWIEKELKSLNKQIKDKKDHENKLREDLRRDATKLTELEKRIEEMTKEMERQRVAIDEHNKQYYECKKKKDQEQSTRNELWRKETTLTQNLSSLKEDLSKADQALRSMAGKPILNGRDSVRKVLETFQERGGEWAKIATQYYGPVIENFSCDKTIYTAVEVTAGNRLFHHIVESDTVGTKILKEMNRQSLPGEVTFMPLNRLQVRDMVYPNDNNAIAMVQKLKYDPKYAKAMKYIFGKTLICRNLECATELGKQFHLDCVTLEGDQVSSKGSLTGGYFNQSRSRLEMQKTRSELMEQITSLDEELNTLRQELNKTETSINTIVSEMQRTETKQGKAKDIFDKVKADIRLMKEELASIERFRGPKERSLAQCRSSLEAMQATKEGLESELHQELMEQLSIADQGKVDELNDAIRRLTQENKEAFSQRMNLEATKNKLENLLTNNLIRRKDELVQALQEISVEDRKRRLATSKADLTGAEKRIKQINKDLEDVERKVQAAVKTEKALKLDLDKWRNKEKEAQDKMEEDAKGLEKMASKEVLLQEKIQESLDKIAALGTLPNAPELHSKYQKLSLKQLFKELEKANQHLKKYNHVNKKALDQFISFSEQKEKLYKRKEELDIGGEKIRELIETLEHRKLEAIQFTFKQVSKNFTEVFKKLVPQGRGSLIMRVAPDEGQDIPDRANADPFTGVGIKVSFTGGEGDMREMNQLSGGQKSLVALALIFAIQKCDPAPFYLFDEIDQALDAQHRKAIANMIHELSSSAQFITTTFRPELLEHAHKFYGVKFRNKVSHVECVTRDEARDFVEDSATHA-