Monarch geneset OGS2.0

DPOGS214699
TranscriptDPOGS214699-TA3645 bp
ProteinDPOGS214699-PA1214 aa
Genomic positionDPSCF300022 - 1303766-1312960
RNAseq coverage584x (Rank: top 22%)
Annotation
HeliconiusHMEL0117730.069.50% 
BombyxBGIBMGA004765-TA0.067.20% 
DrosophilaSMC2-PA0.044.10% 
EBI UniRef50UniRef50_P505330.046.53%Structural maintenance of chromosomes protein 2 n=38 Tax=Coelomata RepID=SMC2_XENLA
NCBI RefSeqXP_001851491.10.048.30%structural maintenance of chromosomes smc2 [Culex quinquefasciatus]
NCBI nr blastpgi|3071739640.047.11%Structural maintenance of chromosomes protein 2 [Camponotus floridanus]
NCBI nr blastxgi|1700480050.048.09%structural maintenance of chromosomes smc2 [Culex quinquefasciatus]
Group
Gene OntologyGO:00055247.3e-67ATP binding
GO:00056947.3e-67chromosome
GO:00055154.3e-45protein binding
GO:00512764.3e-45chromosome organization
KEGG pathwaycqu:CpipJ_CPIJ0101330.0 
 K06674 (SMC2)maps-> Cell cycle - yeast
InterPro domain[2-1158] IPR0033957.3e-67RecF/RecN/SMC
[474-686] IPR0109354.3e-45SMCs flexible hinge
Orthology groupMCL13490 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214699-TA
ATGTATATAAAGTCTATCACACTCGATGGCTTTAAGTCCTACGGGAATCGAGTGGAAGTGAATGGATTTGATCCCGAGTTTAATGCTATCACAGGACTAAATGGGACCGGAAAGTCTAATATCCTAGATTCAATATGTTTTGTTTTAGGCATTACCAATCTTTCTAATGTGAGGGCTGGAAGTCTTCAAGAACTGATCTATAAGCATGGTCAGGCCGGAATCACCAAGGCCACCGTTAGCATCACCTTTGACAATCGTGACAAGAGGCAGTGTCCCATCGGTTATGAGAACCACGATGAGATAACTGTTACAAGACAGGTGGTGATGGGAGGTAAGAACAAGTACTTGATTAATGGCATCAATGTTCAGAACAAGCGGGTCAGTGACCTCTTCTGTTCCGTGCAGCTCAATGTGAACAATCCTCACTTCCTCATCATGCAGGGTCGGATCACTAAAGTGCTCAACATGAAGCCTCCTGAAATATTGTCAATGGTGGAGGAAGCAGCCGGGACGAGAATGTACGAAGCCAAGAAACAAGCTGCACAGAAAACTATAGAAAAAAAAGACGCCAAGCTCAGAGAATTGAATGATATAATTAGGGAGGACATCGCCCCCAAGCTGCAGAAGCTGCAAGACGAACGCTCGCAGTTCCAGGAATACCAGAAAGTGGTCCGCGAGCTGGAGAATCTTACCAGGCTTTATGTGGCTTGGAAATACGTATCCGCGGAGGAGAGCGCCAAGGAAGCCGCGAACAAGGTCACGGAGGTCCAGGACGAGATCAAAGACAAGAAAGAAATGATTTTGAACAATGAAAAAGAAGCCAAAGAGTACGACAAGAAAATAGCTGCACTCAATAAGAAGTTAGATGAGGAGAGCGGCAGCGTGCTCAGGGAGTTGGAGACGGAGCTGCAGGCGGCGGAGAAGACGGAGGCGAAGAGCGCGGCGGACTGGCGCGCGGCCCAGGGCGCCCTCACCACGCATCACGGCCGCGCCAGGCTCGCCTCGCGCGCCTTGGACGACGACCGCGCCGCGCTCAGGGACAAGGAGGCGCAGCTGCATGAGGTGTCGAGTACGTTCGACCGTCTGCGGGAGGCGTGCACCACGAGCGAGGCCCGCTTGGCCGCGACGCAGGCCCGCTTCCTGGCCGTGAGCGCCGGCAACGAGGACGCCTCGGAGTCACTGCAGGACCAACTCATAGCGGCGAAGCAAAAAGCGTCGGAAGCGTCGACTCGAATATCACAGAGTCAGATGGAGAAGAAACATGCGGAAGATCGCCTCGTCACCCTCGAGAAGGAATTCAAGAGTAGCAGCGCGCAGTACCAGAGAGATATGGAGGGGATCGCCCGCCACAGGGCCGAGGTGGCACAGCTGGAGGCTCAGTTATCGTCGAGTACGTTCAGTGCTGATCGTCGGAGCGCCCTCCAAGATCAGCTGCGGGCGTTGCGGGCCGGGGGGCGGGAGAGGAGGGACCGCGCCGACCAGCTCGCAGCACGACTGCAGCGGTGTCACTTCAAGTACACGCCGCCCACCGCCAACTTCGACTCTAACAAAGTATCCGGTACCGTCTGCAGACTCATAGACGTGCGGGATCCGAAGTACTGCACGGCGCTAGAGACGGCGGCGGGGGGACGGCTGTACAACGTGGTGGTGGACACGGACGAGACCAGTAAGCTGCTGCTGCAGCGAGGACAGCTGCAGTCCCGGACCACCATCATCCCTCTCAACCGCATCTCCTCACAACCTCTGTCGCGGGAGACGGTCGCGTTGGCGCAGAAAATCGGCGGCGGCCCGTCCGAGGTTCAGTTGGCGCTCGACCTGATCGACTTCCCGCCCTCGCTCCGCCCAGCCATGTCGTGGGTGTTCGGCAACACGTTGGTGTGCAGTTCCCTGGAGGCGGCGCGCCGCGTGACCTTCGACCCTCGCGTCAGATGTCGCTCCGTGACCCTGGACGGGGACGTGTTCGACCCGGCGGGCACGCTGTCGGGGGGAGCGCGAGCGAGGGGCGGCTCGCTGCTGCTGGAACTGAAGGACCTCAAACACCTGGAGCGGCAGCTCGCGGAGGAGGACGAGCTGCTCGCCACACTCACCCGCGACCTGGACTCCATGCAACACGCCGCTGACCAGCACGCCGCACTGCAGCAGAGGTTGGAGATGAGTCGCCACGCGCTGGCGGTGAGCGAGGAGCGCGCCGCCAGCACCGCCACCGCCCAACTGCACGCAGAGATACAGGCCTTGAAAGATAACGTCGCGCGGCTGACTGCGGCCGTGGAGGAGGACGGACGCACCCGGCAAGAGACGGCAGCCCGCGCCAGGGAGCTCGAGCTCAAGGTTAAGGATATCAAGGGACACAGAGAGAGGCGCTACAAGAAGGCCGAGGAGGAGTTGAAGCTCGCCAAGAAGGAGGCGGAACAACACACCGCCTCCTGGAGACAGAGGGAGCAGGAGCACGACACGCTGCGGCTCGAGCTGGACGAGCTGCGCCGCGCCGTGGACTGCGGACAACAGGCGCTGGAGCGAGCCCGGGACAGCGGACGAGAGCTGGAGGAAGCGCTGGAGCGAGCCCGGGAGATACATGACGCACACGCGAACGAAGTTAAGGAGATACAGACGAAGATCAAGATACAGAAAGCGGAGATAGCGAGTCGCAGCGGAGAGGTGGCCGCCCTGACGAAGGGGAGGGACGAGCTGCTCGGGCGGAACAGAGACCTGGAGCTGGACGTGAAGCAGCTGGAGTACCGCTGCAGGGAGCTGCAGCAGGAGGCGGCGGAGGGGGAGAGGAAGATCAAGTCCTTGGTGGTGGAGAACCCTTGGATCCCGTCAGAGCGTCAGTACTTCGGTCTGTCCGGAGGCGCCTTCGAGTTCGGGCGCGACGTGAGCTCTGGCGGAGCGCGCCTCGCCCAGCTCAGGGACAGGAAGGACCGGCTGGCGAGAGGACTCAACGCACGAGCTCACACCTTGCTGGGGAAGGAGGAGGAGCAGTACCAAGACGTGATGCGCAAAAAGAAAATAGTGGAAGCCGACCGCGCCAAGCTGGTGCAGGTGATGGCGGAGCTGGACGACAAGAAGCGGAGGACGCTGCTGACGGCCTGCGAACAGGTCAACAGGGACTTCGGATCCATCTTCAGCAGTCTGCTACCCGGCGCACAGGCGCGCCTCACCCCGCCGCCGGGACAGAACGTGTTGGACGGTCTGGAGGTGAAGATCGGGTTCAACAACACGTGGAAGGAGTCCCTGGGCGAGTTGTCCGGGGGTCAGCGGTCGCTGGTGGCGCTGTCCCTGGTGCTGGCGCTGCTGCTGTTCCGCCCCGCGCCTCTCTACATCCTGGACGAGGTGGACGCCGCCCTGGACCTGTCGCACACACAGAACATCGGCCGGATGCTCAAGGAGCACTTCACGCACTCGCAGTTCATCATAGTGTCTCTGAAGGACGGCATGTTCAACAACGCCAACGTGCTGTTCCGCACGAGGTTCGCCGACGGCATGTCCGCCGTCACCAGGACCGACAACAGGAGAGATGTTGAGGAACGGTCTCCGCGGACGGGACTCAGATGTCGAAGGATGTTCCTGCTTGTGAGCGGCTTCAATGAAATATCTCCACTGGTTCACGTACTCGACGTAAAAGTACAAGGCTATTTTAACATGTAG

Protein sequence:

>DPOGS214699-PA
MYIKSITLDGFKSYGNRVEVNGFDPEFNAITGLNGTGKSNILDSICFVLGITNLSNVRAGSLQELIYKHGQAGITKATVSITFDNRDKRQCPIGYENHDEITVTRQVVMGGKNKYLINGINVQNKRVSDLFCSVQLNVNNPHFLIMQGRITKVLNMKPPEILSMVEEAAGTRMYEAKKQAAQKTIEKKDAKLRELNDIIREDIAPKLQKLQDERSQFQEYQKVVRELENLTRLYVAWKYVSAEESAKEAANKVTEVQDEIKDKKEMILNNEKEAKEYDKKIAALNKKLDEESGSVLRELETELQAAEKTEAKSAADWRAAQGALTTHHGRARLASRALDDDRAALRDKEAQLHEVSSTFDRLREACTTSEARLAATQARFLAVSAGNEDASESLQDQLIAAKQKASEASTRISQSQMEKKHAEDRLVTLEKEFKSSSAQYQRDMEGIARHRAEVAQLEAQLSSSTFSADRRSALQDQLRALRAGGRERRDRADQLAARLQRCHFKYTPPTANFDSNKVSGTVCRLIDVRDPKYCTALETAAGGRLYNVVVDTDETSKLLLQRGQLQSRTTIIPLNRISSQPLSRETVALAQKIGGGPSEVQLALDLIDFPPSLRPAMSWVFGNTLVCSSLEAARRVTFDPRVRCRSVTLDGDVFDPAGTLSGGARARGGSLLLELKDLKHLERQLAEEDELLATLTRDLDSMQHAADQHAALQQRLEMSRHALAVSEERAASTATAQLHAEIQALKDNVARLTAAVEEDGRTRQETAARARELELKVKDIKGHRERRYKKAEEELKLAKKEAEQHTASWRQREQEHDTLRLELDELRRAVDCGQQALERARDSGRELEEALERAREIHDAHANEVKEIQTKIKIQKAEIASRSGEVAALTKGRDELLGRNRDLELDVKQLEYRCRELQQEAAEGERKIKSLVVENPWIPSERQYFGLSGGAFEFGRDVSSGGARLAQLRDRKDRLARGLNARAHTLLGKEEEQYQDVMRKKKIVEADRAKLVQVMAELDDKKRRTLLTACEQVNRDFGSIFSSLLPGAQARLTPPPGQNVLDGLEVKIGFNNTWKESLGELSGGQRSLVALSLVLALLLFRPAPLYILDEVDAALDLSHTQNIGRMLKEHFTHSQFIIVSLKDGMFNNANVLFRTRFADGMSAVTRTDNRRDVEERSPRTGLRCRRMFLLVSGFNEISPLVHVLDVKVQGYFNM-