Monarch geneset OGS2.0

DPOGS205304
TranscriptDPOGS205304-TA2919 bp
ProteinDPOGS205304-PA972 aa
Genomic positionDPSCF300021 + 1377205-1380780
RNAseq coverage290x (Rank: top 38%)
Annotation
HeliconiusHMEL0161990.070.42% 
BombyxBGIBMGA011044-TA0.066.27% 
DrosophilaMms19-PB7e-6938.46% 
EBI UniRef50UniRef50_UPI0002063ECF5e-11532.14%UPI0002063ECF related cluster n=3 Tax=unknown RepID=UPI0002063ECF
NCBI RefSeqXP_001656411.12e-9728.60%DNA repair/transcription protein met18/mms19 [Aedes aegypti]
NCBI nr blastpgi|3838662451e-12031.83%PREDICTED: MMS19 nucleotide excision repair protein homolog [Megachile rotundata]
NCBI nr blastxgi|3800246383e-12832.85%PREDICTED: LOW QUALITY PROTEIN: MMS19 nucleotide excision repair protein homolog [Apis florea]
Group
Gene OntologyGO:00054883.1e-26binding
KEGG pathway 
InterPro domain[248-945] IPR0160243.1e-26Armadillo-type fold
[848-961] IPR0119892e-06Armadillo-like helical
Orthology groupMCL15176 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205304-TA
ATGGAGGCTATTTGGTATAAATCGGAGCTAGCCGATGAAATAATTAAAAATAATGATATATTTAATCAAACCCAAAGCATAATAGCAGATGTCGTTTCTGGAAAATTTGATATCACTGTCCTGGTAGAAAATATGGCTGGTGTGCTAACCAGCAAAGAAGTAGAAGACAGAGAACGAGGCATGAGGTTTTTCACAAAAATCTTAAAGGAAATTCCTGCTGATTATTTGACAGAGATGCAAATTAAATTCATATCAAAATTCTATGTAGATCGTCTTAAAGATAACCACAGAGTGATACCGGCAGTTTTAGAAGGCTATTTGTCTGTTATTGATATGAAACATTATAACGTGCAAAGTAGTGGTGAATATCTCACTGCTTTATTCCGGGAGGTGGCATGCCAGTCACAAGTAAGGCAAGATAGGTACAACATATACCTAACTATTCAAAAATTGTTTGCCAAGAATGTAGAATACATGAAAACATTAGGCCCGGATTTCGTATATGGATTCATATCGTCTATGGATGGAGAGAGGGATCCACGAAACTTGCTTTTCTTGTTTAATTTCTTACCGAACTTCCTAAGTAACATTCCATTGGGACATTTAGTTGAAGAAATGTTTGAAGTTATATCATGTTATTATCCCATCGACTTTCATCCGTCTCCTGACGACCCAGCAGCTGTGTCGAGAAACGATCTAGCTGCTGTATTATGTCCTTGTTTATGCGCGATTCCAGAGTTTGCTGAACATTGTCTTGTATTATTAATAGAGAAACTGGATTCTAACCTTCGAGTAGCTAAAATTGATTCTCTTATGCTATTGGCAGCTAGTTGCAAGACCTTTAAATATGAAAGTTATGGACCATTTTTGAAAGCATTGTGGTCTTCACTTCAAAGAGAACTTACACATAAAACTGACGACGAGTTAAAGTTAGCTGCTCATGAAGCTCTATCAGCTTTAGTATCAAAATTGTCCACAAAAGCAGAAACCGATCAATCATTCGAGAACTTCATTAAAGGAATACTTATATCCATGCAAAGTGCTATAGCCGAGTCTTCAACTGTTATCCAGTTTATGGAAGCAGTGAAAATATTATTAACGGCCGCCAATGCTTCAAAACAATCATGTGTTCTGATAATAAAAGCGATGATACCAGCCATTTTAGCATACTATGAATTCAAACCTTTGGTGAAATTACAAATTTCTTGTTTAGATGTTTTAGGTGATATGTATGAATTGGCTGATCATTGGGGGGTTTTGGATGAAATGGAAAAAGAGGTAAATGAAATACCACAGTTATGCCTTACAGCTGTCAGTGAGAGGGTGAAAGACTATCAAGTATCTGGCTTTAAAACTCTAATAAGAGTTAAAAATGTTCTTCACATAGATTTGGTGCTACCATTTGTGGAAATTCTGATCTATAACATTCAACATTCTCAAGATAGCGAAATATTGAGTGTTTGTGTTGAAACAGTACATGCTATAGCAAGGAAATATCCAGAGCTTATAATGACTTTGGTCATAAAAGGGAAGTGTGACTTGGAAAACCTAACACAGGACAAAACAGCGCTACAAAAAAGACTAGATCTGCTCTCAAATCTTGCCAGTATAGATGATTTTACCAAAATTATTATAGAGGAAATGTTGAAAGTAATAACAACAAATGATGAGGAAGCTTCTAAAGTTGTCAAAGCACTCAGTGGATCTATATCCAATGTAAGTTTGTATACAGAAGAAAAAGTCGCACAGATAGAAAGTGATCATGGCCTTATAAGTTCCATTATGGCCTGGTTAACGAAATCAATTTTGGATGAATCTCATGAATCGTTGAATCACGGCTGTACATTGATATCGAACACAATATGCAGTTTGCCGCCTGAAAAGCAAGCTAATATTTTATCCAAACATTCAAAGGCTATTTTAGAGAAGTGTGATTCAAATGAGATGTACTTCTTAATATTAGAATGTTTGTATCGTTCTATAAGTCCAACCATCTATGACACAAATTTCAAGGACATTATGGGTTTGGCCTTAAAACTAGCTTTAAACTGTGAGAACCAGTTACTTAGAACGAAAGCTTGTTGTATGGTAGCCCATTTTCTTAATAAGGCTCAGAGTGGTCCAAATTTTGAGATCTTAAATGAGGTTTTGAAATCTTATTTAACATCATGTAGCAGAGATAATGTTAATATATTACCAAGACTAATAGAATTGTATGGCTGGATAACAAAGGCGCTTATTATGAGAGGCAATGACCTCTTCCAATTTTGGCTTTCCAAGATATTGATTTCTATTTCAACAAGCGAATGCAGTGTTGAGGCATCAGAAGCTATCAAAATAATAATGACGGATTCTGAGAATTGTCTTAACGCTAGACATCATTGTAGGACAAGTTTGTTATACAGGCAGAGGATGTTCCAGACATTCGTCAATTTGACTGAAAAACTTGGACCACCAAATTCTGATTCCGAAGAGGCCTTTTACTTAAGTTGGGGTTATGTTTTAGAGAAAACGCCGAAAAGCATACTTAATAGTCAAATAAATAAGGTTACACCTTTGGTTATAGATGCTTTAGTGTATGACAATAAAGAATTGTTGAAGGTGATGTTAGAAGTCCTAATACATTTTGTGCAATCAAAAAACATAACAGTGGGACACAGTTTACAAACAATTTTACCCAGGCTAATAAATTTAACTACATATGTTAAATGTATGGATGTCAGAATAAAAAGTCTGCAATGTCTGTACGAAATCGCAAATTCTTACCAGACAAGATTGCTTTTACCTCATAAGCAGGATATTTTAATCGATTTAGCGCCATCTCTTGACGATAAGAAGCGACTTGTGAGGAATATGGCGGTTAAGGCCCGAACAAGATGGTACTTAGTTGGAGCTCCAGGCGAAAGTAAAGAAGATTAA

Protein sequence:

>DPOGS205304-PA
MEAIWYKSELADEIIKNNDIFNQTQSIIADVVSGKFDITVLVENMAGVLTSKEVEDRERGMRFFTKILKEIPADYLTEMQIKFISKFYVDRLKDNHRVIPAVLEGYLSVIDMKHYNVQSSGEYLTALFREVACQSQVRQDRYNIYLTIQKLFAKNVEYMKTLGPDFVYGFISSMDGERDPRNLLFLFNFLPNFLSNIPLGHLVEEMFEVISCYYPIDFHPSPDDPAAVSRNDLAAVLCPCLCAIPEFAEHCLVLLIEKLDSNLRVAKIDSLMLLAASCKTFKYESYGPFLKALWSSLQRELTHKTDDELKLAAHEALSALVSKLSTKAETDQSFENFIKGILISMQSAIAESSTVIQFMEAVKILLTAANASKQSCVLIIKAMIPAILAYYEFKPLVKLQISCLDVLGDMYELADHWGVLDEMEKEVNEIPQLCLTAVSERVKDYQVSGFKTLIRVKNVLHIDLVLPFVEILIYNIQHSQDSEILSVCVETVHAIARKYPELIMTLVIKGKCDLENLTQDKTALQKRLDLLSNLASIDDFTKIIIEEMLKVITTNDEEASKVVKALSGSISNVSLYTEEKVAQIESDHGLISSIMAWLTKSILDESHESLNHGCTLISNTICSLPPEKQANILSKHSKAILEKCDSNEMYFLILECLYRSISPTIYDTNFKDIMGLALKLALNCENQLLRTKACCMVAHFLNKAQSGPNFEILNEVLKSYLTSCSRDNVNILPRLIELYGWITKALIMRGNDLFQFWLSKILISISTSECSVEASEAIKIIMTDSENCLNARHHCRTSLLYRQRMFQTFVNLTEKLGPPNSDSEEAFYLSWGYVLEKTPKSILNSQINKVTPLVIDALVYDNKELLKVMLEVLIHFVQSKNITVGHSLQTILPRLINLTTYVKCMDVRIKSLQCLYEIANSYQTRLLLPHKQDILIDLAPSLDDKKRLVRNMAVKARTRWYLVGAPGESKED-