Monarch geneset OGS2.0

DPOGS207117
TranscriptDPOGS207117-TA4176 bp
ProteinDPOGS207117-PA1391 aa
Genomic positionDPSCF300001 + 3344856-3355744
RNAseq coverage632x (Rank: top 20%)
Annotation
HeliconiusHMEL0096080.079.75% 
BombyxBGIBMGA013079-TA0.070.15% 
Drosophilasno-PD0.064.99% 
EBI UniRef50UniRef50_F4WMK30.066.38%Protein strawberry notch n=9 Tax=Arthropoda RepID=F4WMK3_ACREC
NCBI RefSeqXP_001807824.10.063.62%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|3071975220.065.69%Protein strawberry notch [Harpegnathos saltator]
NCBI nr blastxgi|3320244160.066.09%Protein strawberry notch [Acromyrmex echinatior]
Group
KEGG pathway 
Orthology groupMCL10521 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207117-TA
ATGTCAAAAGGTGAGCTCAGTTCAAAACGCAGCGCATTCCCCGCTCCTTCAGATGATGACTCTGATTTTGATGATGATGAGGACCCAGACAACTTGGAAGTGCCAGGTGGAGGTAAGACTTTGGCAGCGGCAGCGAGAATGGGCACTAAAGCAAAGGTGCTTCAACCTGTAACTGTGAAAGCTACAACTAACCCACTAGCTGCGCCCTCTGGTGTGGCGTTCGGTCAGGCTGCAAACATTAAGCCCATCAAGATATCAGCAAATTCTCTCAAGAAGCCCATCAGTTTACTGGGATTGGGAGCCGGTGCGAGCTCCTCTGTACCTGTGATGCAACATTCACGTAATCATATGAATGGTGCAGGAATGACTAGCATGAACTCGTTCTTGCTCCAGAACTTGAACGATATTCTAAATTCGAGCATAAACGGGGTATTGGGTGGGTCTCTAAGCGGAGTGTTGGGAGGCTCGTTGGGTGGTTCGATGAGTGGCTCGATGGGAGGCTCAATGGGAGGCTCGATGGGAGGCTCGATGGGAGGCTCGCTGGGAGGTTCGTTGGGAAGCTCATTAGGAAACTCGTTGGGTGTTCCCCCTTCCCTGGCCGGGATGTTGGTTCAGGTGGCGCAACTGCAAGGCGGTGACAAGGGTGGATGGAATCAGGTTAAGCTACCTGGCGAGGAGGAGGAGGTGGACGACGAGGAGATGGGAGTAGCGGAGACCTACGCCGACTACATGCCCACGAAACTGAAGCTGGGTCGAAAGCACCCGGACCCGGTGGTTGAGACGGCGTCTCTGTCGTCTGTGGAGCCGGTGGACGTGACTTACACACTCAGTCTGCCTGACGACACGATCCGCTCCGGCTTGTTGTCCGCTCTACAGTTGGAGGCCGTGGTCTACGCCAGCCAGGCCCACGAGCACACACTACCAGACGGCACCAGGGCCGGATTCCTCATCGGCGACGGCGCGGGAGTGGGGAAAGGTCGTACCATAGCTGGTATAATCTTTGAAAACTATCTGAAAGGTCGGAAACGTGCCGTGTGGATATCCGTATCAAACGACCTCAAGTACGACGCCGAAAGGGACCTTAGAGACATTGGTGCATCTAAGATCGAAGTACATCCTTTGAACAAGTTCAAGTACGCTAAGATCTCGTCAGCTATCAATGGTAATGTGAAGAAGGGCGTCGTGTTCAGTACGTATTCTGCTTTGATAGGGGAGACCCAGGCCAATACTAAATATCGCACCAGACTGAAGCAGCTGTTACAGTGGTGTGGCGAGGACTTTGATGGAGTCATTGTATTCGACGAATGTCATAAAGCTAAGAATCTGTGTCCCGTTGGATCGGGCAAGGCGACCAAGACTGGCCTGACAGCGTTGGAGTTACAGAACAAGTTGCCCAAAGCTAGGGTCGTGTATGCGTCGGCCACAGGAGCTTCAGAGCCTAGGAACATGGCATACATGGTTAGATTGGGTATATGGGGCGAGGGTACACCATTCCCAACGTTCATGGACTTCATCAATGCTGTGGAAAAGAGAGGTGTGGGCGCCATGGAGATTGTAGCTATGGACATGAAGCTGAGAGGCATGTACATAGCTCGGCAGTTGTCCTTCCACGGGGTTTCTTTCAAGATCGAGGAGGTGCCGCTTTCGGACTCGTTCAGGGAGACTTACGATAAGGCTGTATCGCTGTGGGTGGAGGCCATGCAGCGGTTCACTGAGGCTGCGGAACTGATCGACGCTGAGCCCCGCATGAGGAAGACCATGTGGGGTCAGTTTTGGTCGGCTCACCAGCGTTTCTTCAAGTACCTCTGTATCGCGGCCAAAGTCAACCAGGCGGTGGTGACGGCACGCGAGGCCGTCAAGTGTGGCAAGTGTGTCGTGATCGGTCTACAATCAACCGGAGAAGCCAGGACTCTGGACCAGCTAGAGAGAGACGACGGGGAGCTATCTGACTTCGTCTCCACAGCTAAGGGTGTATTTCAAACGCTAGTAGAGAAACATTTCCCGGCTCCGGACCGCGAGCGCATCAACCGGCTCTTGGGGCTGGAGAAAAATAAGACGATCGCACCTCCTCCGCCAACAGTCAACGGAAAGAATGGACTTGGAGATAGTAATTCTAAAAGGAAATTATCAGCAAGGCAACAAGTCAATATGGAGGCGAAGCGTCTCAGAGGCAGGTCGTCGTCGGACGAGTTCGTACGTTCAGATGGTGAGGAGGACGGGGACGGAGAGGACCACTCCGAAGACGATCAGCGATCAGACTCTGAGCTCAGTGACTTTAATCCGTTCAAAGCGGGCAGCGATAGTGACGACGATCCATGGATTGGCAGAAAGAAGAAAGTTCCAAAGAAGAAAAAGGCAGTGAAGAAGAAAACTGCTAGCACACAGGATAAGATTGAAACTATGTTCGAACGTAAGACGCAGCCAGTGACAGTTAACTCTGGAGGCAAGACTCTCATCGGCACACCGGTGGGGACTAACGGGCTCTATCTTGGTCCGGGTCGCGGTGCTCCTCCAGCCAGATCAGCTATCGAACGAGCATGCAGCATGAAGGAACAGCTTTTGGCGGCCGTAGAGAGGCTTGGACGCCGCCTGCCGCCTAACACACTCGATCAGCTGGTGGACGAACTGGGCGGCACAGACAACGTGGCTGAGATGACGGGTCGGAAGGGTCGTGTGGTGCAAACGGAGGACGGTCAGATCCTATACGAGAGTCGGTCGGAGGCTGATGTGCCTCTGGAGACGCTCAACCTGACGGAGAAGCAGCGGTTCATGGACGGAGAGAAAGACGTCGCTATCATCTCTGAAGCAGCCTCCAGTGGTATTTCGCTGCAGAGCGATCGAAGAGCACGTAATCAAAGGCGAAGGGTCCATATTACGCTGGAGTTGCCGTGGTCAGCTGATCGAGCGATACAACAGTTTGGTCGAACACACAGATCGAACCAGGTAAACGCGCCGGAGTATATCTTTCTTATCTCGGACCTGGCCGGCGAGCGACGGTTTGCTTCAACTGTTGCCAAGAGGCTGGAGTCACTTGGAGCACTTACACACGGGGACAGACGAGCCACGGAAACTAGGGACCTCAGCCAGTTCAATATCGATAACAAATACGGTCGTACGGCACTGGAGGCGGTGATGAAGGCGATCATGAAGTACGAGCCTCCGCTGGTGCCCCCGCCGCGGGACTACTCGGGAGACTTCTTCCAAGACGTTGCCTCCGCCCTCGTCGGCGTTGGACTCATCGTGAACAGTGAAGCGGCCCCAGGGTTGCTGTCTCTTGACAAGGACTATAACAACATGTCAAAGTTTCTGAACAGGATTCTGGGAATGCCTGTGGAATTGCAGAACAGGTTGTTCAAGTACTTTACTGATACACTCACCGCCGTCATGGAGCAGGCGAAACGCAGCGGCCGCTTCGACCTTGGTATCCTTGATCTCGGTAGTGCCGGCGAATGTGTGCGTCGTGTGCGGTGTGTGAGGTTTCTTAGAAGACACGCAACGGGACAGGCCCCAGTCGAGCTACACACCGTGCAATCAGAACGAGGCATGGAATGGACTGAGGCCCTGGACAAGTTTTCTCAGGCGCAGTCTCAGTCTATGACAGAGAAAGCCGAGGAGGGCTTCTACGTGTCGGCAGCGCGCGGCGGAAAGGCCTCTGCCGTCCTCTGCCTGGCTGCGACCGCGCCTCGTGACCGACGAGAGCGCGCGGACCGCGTCGCAAAGAAGGATCGCATGTTCCACGTGTATAGACCTAACACCGGACTACAGCTGCGGCTCGAGTCGCTCACGGAGATCGAGAAGAAGTATCGCAAGGTGACGCCAGACGAAGCTGAGGCTGCGTGGCGAGCACAGCACGGGGCTTCGCTACGGGTGTGCGCGCACGCATACTGGCGCAGCACGTGCAGGGCGCGCGACACGTGCGAGGTGGGGTTGCGCGTGCGCACGCATCACGTGCTGGCTGGCTCGCTGTTAGCCGTATGGGCCCGTGTAGAAGCGGCGCTGGCTGCACGCGCTGCAACCTCCAAGATGCAGGTGGTACGCCTCAAGACAGACGACGGTTTAAAAATTGTTGGTACACTCATCCCCAAAAATTGTGTGGAGATTCTCAAGGAGACGTTGTCGTCGGATGCGGTGTCGGTGACGGAGCAGACGTTCGACACGACGGACTCGCTCCAGTGA

Protein sequence:

>DPOGS207117-PA
MSKGELSSKRSAFPAPSDDDSDFDDDEDPDNLEVPGGGKTLAAAARMGTKAKVLQPVTVKATTNPLAAPSGVAFGQAANIKPIKISANSLKKPISLLGLGAGASSSVPVMQHSRNHMNGAGMTSMNSFLLQNLNDILNSSINGVLGGSLSGVLGGSLGGSMSGSMGGSMGGSMGGSMGGSLGGSLGSSLGNSLGVPPSLAGMLVQVAQLQGGDKGGWNQVKLPGEEEEVDDEEMGVAETYADYMPTKLKLGRKHPDPVVETASLSSVEPVDVTYTLSLPDDTIRSGLLSALQLEAVVYASQAHEHTLPDGTRAGFLIGDGAGVGKGRTIAGIIFENYLKGRKRAVWISVSNDLKYDAERDLRDIGASKIEVHPLNKFKYAKISSAINGNVKKGVVFSTYSALIGETQANTKYRTRLKQLLQWCGEDFDGVIVFDECHKAKNLCPVGSGKATKTGLTALELQNKLPKARVVYASATGASEPRNMAYMVRLGIWGEGTPFPTFMDFINAVEKRGVGAMEIVAMDMKLRGMYIARQLSFHGVSFKIEEVPLSDSFRETYDKAVSLWVEAMQRFTEAAELIDAEPRMRKTMWGQFWSAHQRFFKYLCIAAKVNQAVVTAREAVKCGKCVVIGLQSTGEARTLDQLERDDGELSDFVSTAKGVFQTLVEKHFPAPDRERINRLLGLEKNKTIAPPPPTVNGKNGLGDSNSKRKLSARQQVNMEAKRLRGRSSSDEFVRSDGEEDGDGEDHSEDDQRSDSELSDFNPFKAGSDSDDDPWIGRKKKVPKKKKAVKKKTASTQDKIETMFERKTQPVTVNSGGKTLIGTPVGTNGLYLGPGRGAPPARSAIERACSMKEQLLAAVERLGRRLPPNTLDQLVDELGGTDNVAEMTGRKGRVVQTEDGQILYESRSEADVPLETLNLTEKQRFMDGEKDVAIISEAASSGISLQSDRRARNQRRRVHITLELPWSADRAIQQFGRTHRSNQVNAPEYIFLISDLAGERRFASTVAKRLESLGALTHGDRRATETRDLSQFNIDNKYGRTALEAVMKAIMKYEPPLVPPPRDYSGDFFQDVASALVGVGLIVNSEAAPGLLSLDKDYNNMSKFLNRILGMPVELQNRLFKYFTDTLTAVMEQAKRSGRFDLGILDLGSAGECVRRVRCVRFLRRHATGQAPVELHTVQSERGMEWTEALDKFSQAQSQSMTEKAEEGFYVSAARGGKASAVLCLAATAPRDRRERADRVAKKDRMFHVYRPNTGLQLRLESLTEIEKKYRKVTPDEAEAAWRAQHGASLRVCAHAYWRSTCRARDTCEVGLRVRTHHVLAGSLLAVWARVEAALAARAATSKMQVVRLKTDDGLKIVGTLIPKNCVEILKETLSSDAVSVTEQTFDTTDSLQ-