Monarch geneset OGS2.0

DPOGS212297
TranscriptDPOGS212297-TA3642 bp
ProteinDPOGS212297-PA1213 aa
Genomic positionDPSCF300077 + 937915-946337
RNAseq coverage226x (Rank: top 44%)
Annotation
HeliconiusHMEL0149540.039.97% 
BombyxBGIBMGA011454-TA9e-14450.65% 
DrosophilaCG7139-PA6e-4126.50% 
EBI UniRef50UniRef50_D7EJV71e-6030.09%Putative uncharacterized protein n=2 Tax=Coelomata RepID=D7EJV7_TRICA
NCBI RefSeqXP_968941.12e-6130.09%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910946175e-6030.09%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910946174e-6630.38%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00055249.4e-05ATP binding
GO:00510859.4e-05chaperone mediated protein folding requiring cofactor
KEGG pathway 
InterPro domain[1106-1192] IPR0026251.8e-10Smr protein/MutS2 C-terminal
[1039-1098] IPR0138997.5e-06Domain of unknown function DUF1771
Orthology groupMCL16560 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212297-TA
ATGGAAGAAAAAGATAAAGAAACCGATTACAATAATGTTGTCGATAAACTATTGGAAACGTTCGGAGAGATTCTCAGTAGAGATGTGATGTTAGCTATCGTGGAAAGCTACGAAGGAGACCTTAATGAATCGGCGAATGCTGTTATGAACATCTCTATGGATGGGAATTGTAGGAATCCTTCAAATAGCGGAAATGTTCTAAGGCCAAACAACATGTATTCATTCCGTCCCAAGAACCCCTATGTCAACCATCCTGGCATAAGGCAGCCACATATGAGCCATTTTTACCCTCCAAATGGTTTGTTTGCACAGATTGGAAATCCACATAATGTTCCTCAGCGGCCAATGTCATATGCTGCTGCTTATCATCAAATGTCTCATCAAATGATTAACCAGCCAGTGAGACCTACAAGTAAATCTGACAATAAAAATCAAAGCATTAAACCTGTTCCCAAATCTGATCAACGGAATGATTTTACGAAAAACACTCCAAGTCCTGACCTTCAAAATATCTATAAACACCACGAGAATGGTCATCGTACTTTAATAATTTTGCGAGGTGCACCTGGTAGTGGAAAAACATATCTGGCGCGTAAAATTATTGATACGCTTTATAATAAACGTAATAATAATTACTATATGCACATTTTTAGTACTGATCAATATTTTACAAGGAAGGGTGTTTATGAATATTGCAGAAACAGACTATCTGAAGCCCATGAGTGGAATCAGAAACGTGCACATGGTGCTATGAAGCAAGGTGTAAGCCCTGTAATCATAGACAACACAAACACTGAAATATGGGAAATGAAACCATATGTAGACAGTGGAGTGAAGTTTGGATATATTATAGAAGTTCTTGAACCTGACACACCTTGGGCTAGAAAAGCTCAAACACTGTGCAAGAAAAATTCGCACAATGTTCCCTTTGCAAATATTAAACGCATGCTGGAAAACCTGGAACATAATCCAACAGGGCCATCCTTGCTACATCACTATTCTTTGCAATATGAACCAACTATGACACCGCCTATCTTGCGGTCACTACCGCCGTTTAATGAAAGATCGCCAGAAGAATTGGGTTCCACTAATAATAGGCCAAAATATGTTGCAAATGAAAACATTAAAAACGTCCGTACAAACAATTATAACCACCAAATAAAACAGCAATCACCGAATGCTAGCTCGTCACAGCAAAGTATTTCTGCCTCAGTAAATAATACAAAAACAAACAAATCTGCAGAAAATCCATTTTTAAATGCTTTACATGTGAACAATCCAGTAAACGATAATACAGATGATGTACAAAAAATCTTAGAAACATTTGAAAAAGTGGAGATTGAATGGGAAAGTGGTGAGCAATGGGAAGCAGAGTCAGCAAGAAGGGAAGAGAATTTAGAATGTCAGGGTGCAAAAGCATTAGATGCAAAACCGCAGAGAAAACATAAAGCAGTGAACATCGAAAATACCGATAATGCACTAAAAGGCAATTTGACAAATTGTCAAGATTGGACAAAAATAGGAATGTTCTTACCACCATGGTCTGATGAAAATGAAAACAAAAACGTACAAGATGTAGAGTCCTCGATCCCGCTCGTAGAAAAAAGATCCAACTTTACCTGCTTCGAAATCGGGGATACAAATATCAGTCATGCAAAAAATCTATATAAAATTGTCTCTGCTAAGCCACGAGACATCAATGAATTTCATCTCCCGATAACAAGCAATAAAATTCCTGATCAATGGATGTTAGATAAAAGTACTTCAACGCATGACAATCAGATGTTGTCACCAATAAAACGTTGTAAGAACGAGGAGGAACATTTTAAGTTGTTCTCGAGATTATTTAAAAACGTGGATGTTGATATATTGAGGGATATTTTCGACAAATGTTGTGGGGATATAAACTGGGCGGTAGATATCGTACTTGATGACATGGCTAGCGAAAACGTCAATAATATAATGGCCAACATAAACACACCAATTCATGACATAGATATGCCTGAACTTGATTGCGACTGTCTCTCCGCCTATAACATAATACCAGAACGACTTGAGGCAATCTTACCAGAAAACCAAATGCCTAACCAGACGGATACCCAGTATACAGTAAATTTCAAAAAGAATAAAAAAGAATGCACTGTCTCAGAGGCTTCTATCGAGTTGAAAAAACAAATAGAAGGAAATTTTGTCATACCTGACAATCATTACTCTGAACGTTGCTTAAAAATAAGAAAACTTCGTCGTGGTGAAGTTGATACTCAAGAAACATTTGATTCTGAAGACAATACTAACAACGAGCCTTCAACATCGGCTGAAACTTCAAGAACTACTGATCAAAATTCAGCATATTTTGACAATAATTTACCAAGCACATCTAAAACGGATAGTGTTGTACCAGAAATAAGTGACTCAGCTTCTTGTGTAAGCTCTGATGAGGATGAAGAAAAGACTGTAAACATATGTTTAGGAAGAGAATTTGTGACGAAATTAGATGAAATCTATGGAGGTCAAACAGCTGAATGTCTCAGTTTCATAAATCCGACAGTCAACATCCCAATGTCACTTTTAAGTATAATAAACGCTCTATGGATTGAATCATTAACCGATCAGTTGCAAGAAGATTCTAAACAATCTCAAATAATGATGGAGCAAGACGCGGAACTCGCGCGACTCTTAGCAGCTAAAGAGGAGGAGTTAAGTCGTTTGGGACAAGAACCAGAAGTACCAGATTTCAAGGAAATCATGGACATGGATTACGCCTTGTCTATATATCAGAGGGATGTATTGGAATGGCGTAATAATTTACCAGATGATCTCGCAGCAAAACTTTCAAGAGAAAAGTTGTATAATTTATTCCCATCGGTATCCCCGGACGTGCTATCAGAACTTCTGGCGGCCCATGACAATAATTTCCATCTAACCGTCGAGGTGCTCTTGACCTCTACTGGACAAACTGATATTCTGCAAGCGAAGAATGGCGTTAATAAATTTATTGTGCAAAAAGAAATGGAGCGGCATGAGAAACTGCTCCAAGAAGAGAGAAAAGCGCTGTCTGAAGTTGAGTGGCCATTGTTGCCGAAAAATGAGAAAGTGGATATGTCCACTGTACAACATTTTCGCAACTGCGCCGACAAGCACCTCAGAATTAGAAACGATAATTACAACAAGGCGAGTGAATACTTCAGGCGTGGTATGACTCAAGTGTCGACTTATTATTTAGAACTGGCGAATTTTCATAAGACAAGATTCGAACATTCCAATTCCATGGCAGTAGCTTCCTTGATACAAGTACACGCAGCCAACTCCTCCAATAACGCTACTCTAGACCTCCATTATCTGAGGGTCCGTGAGGCTAAGGAGGCATTAGACCTCTTCCTAGATACACATATACAAAAATTGAAAGAATTGCAAACCAGATCCAGTGTGAGATGTCACGATCTGTTCTTTATAACCGGCAGAGGAGCCCACAGCCAGGGCGAGCCTAAACTAAAACCAGCTGTTCAAAAGAGGCTATTGGAACGCGGATTGAATTTTATAATACACAATCCCGGCCTTTTAATAAGTAGCGTAAGGTCTGATAACAAATTGACGTGCGAGATATCTAGTGCCGGTCAAGATGGGCCCCCTCAAGACCCATGA

Protein sequence:

>DPOGS212297-PA
MEEKDKETDYNNVVDKLLETFGEILSRDVMLAIVESYEGDLNESANAVMNISMDGNCRNPSNSGNVLRPNNMYSFRPKNPYVNHPGIRQPHMSHFYPPNGLFAQIGNPHNVPQRPMSYAAAYHQMSHQMINQPVRPTSKSDNKNQSIKPVPKSDQRNDFTKNTPSPDLQNIYKHHENGHRTLIILRGAPGSGKTYLARKIIDTLYNKRNNNYYMHIFSTDQYFTRKGVYEYCRNRLSEAHEWNQKRAHGAMKQGVSPVIIDNTNTEIWEMKPYVDSGVKFGYIIEVLEPDTPWARKAQTLCKKNSHNVPFANIKRMLENLEHNPTGPSLLHHYSLQYEPTMTPPILRSLPPFNERSPEELGSTNNRPKYVANENIKNVRTNNYNHQIKQQSPNASSSQQSISASVNNTKTNKSAENPFLNALHVNNPVNDNTDDVQKILETFEKVEIEWESGEQWEAESARREENLECQGAKALDAKPQRKHKAVNIENTDNALKGNLTNCQDWTKIGMFLPPWSDENENKNVQDVESSIPLVEKRSNFTCFEIGDTNISHAKNLYKIVSAKPRDINEFHLPITSNKIPDQWMLDKSTSTHDNQMLSPIKRCKNEEEHFKLFSRLFKNVDVDILRDIFDKCCGDINWAVDIVLDDMASENVNNIMANINTPIHDIDMPELDCDCLSAYNIIPERLEAILPENQMPNQTDTQYTVNFKKNKKECTVSEASIELKKQIEGNFVIPDNHYSERCLKIRKLRRGEVDTQETFDSEDNTNNEPSTSAETSRTTDQNSAYFDNNLPSTSKTDSVVPEISDSASCVSSDEDEEKTVNICLGREFVTKLDEIYGGQTAECLSFINPTVNIPMSLLSIINALWIESLTDQLQEDSKQSQIMMEQDAELARLLAAKEEELSRLGQEPEVPDFKEIMDMDYALSIYQRDVLEWRNNLPDDLAAKLSREKLYNLFPSVSPDVLSELLAAHDNNFHLTVEVLLTSTGQTDILQAKNGVNKFIVQKEMERHEKLLQEERKALSEVEWPLLPKNEKVDMSTVQHFRNCADKHLRIRNDNYNKASEYFRRGMTQVSTYYLELANFHKTRFEHSNSMAVASLIQVHAANSSNNATLDLHYLRVREAKEALDLFLDTHIQKLKELQTRSSVRCHDLFFITGRGAHSQGEPKLKPAVQKRLLERGLNFIIHNPGLLISSVRSDNKLTCEISSAGQDGPPQDP-