Monarch geneset OGS2.0

DPOGS209471
TranscriptDPOGS209471-TA2469 bp
ProteinDPOGS209471-PA822 aa
Genomic positionDPSCF300275 + 231580-234358
RNAseq coverage599x (Rank: top 21%)
Annotation
HeliconiusHMEL0044952e-11762.33% 
BombyxBGIBMGA005851-TA2e-13043.08% 
DrosophilaBsg25D-PB1e-1027.92% 
EBI UniRef50UniRef50_B4JPK95e-4226.46%GH13539 n=7 Tax=Drosophila RepID=B4JPK9_DROGR
NCBI RefSeqXP_969726.21e-5726.25%PREDICTED: similar to Blastoderm-specific gene 25D CG14025-PC [Tribolium castaneum]
NCBI nr blastpgi|1892409272e-5626.25%PREDICTED: similar to Blastoderm-specific gene 25D CG14025-PC [Tribolium castaneum]
NCBI nr blastxgi|1950505353e-5426.38%GH13539 [Drosophila grimshawi]
Group
KEGG pathway 
Orthology groupMCL25178 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209471-TA
ATGGATAGTCCTATGAATCGCTACGAACAACAGCTGTATAGTGTGTTCAAAACCTTCGACATAGATAACGAGGAAGCCTTAAAAAGGTCAGCAGTACTTCAGCTCTGTGACTCTTTACAGCTTGAAGATCGCGGCGCGGCGCTCGTGGACACTCTCTTTGACCGACAGAGCGACCGTATCACATTTTCACAATTCCGCAACGGGTTGCTCTCCGTCCTAGATAAAGATCCCATAAGACGCCAAGAGAACTGCTCGAGTAAGGAGAGTGTCGTCCCCAAGACCCCGAGCCCGGCCCAGAGCGACGACGACTCAAGAGAGGTGGGTGCTAAATTCGCTTTCGGTTCGGAGAGGTACGGGAGACGATCACGGCCTCATCGCGTCACCGCGGAAGACACACCGCGTCCTCGGGTGGCGTCGGAATCTCGTCTCGACAGTGTGCGGGCGAGGCGACGTATGAGGTGCAAGAGGAGCTCGTCTGCGATGGAGAGTCGCGAGGACGACGTGACTGGTCTGGAACTAGACCACGAGCAACGCGTCGACCGAGATCGAGCTCTCGCCCTGTGCCGGGGACTCGACATGCATGGAGTCGACCGGTTTCTAGTCGACCGCGTCTTTGCAGCCTCCGATGAGAATGAGGTGACGGTGGGCGAGTTCTTCGATCGACTGAACGCCTCGCTTACGACTTCCATTGAAACGTCTCTAGATGGAGGTATTGTGGAAGGTGATTCTAAATCAGATTCTGACATTGTCTCGAGTGATACAGTCGTAGAGACCTGGAAACGAGCTGGTGTCCGAAAACCGATGGGGTTATTATTAGAGCTAGGTTTCGGAGAAGCCGCATTACGGTTGCAGTCTCTCGAACACGTGCTCGACGATGAGTTACGAGCAAACTCCTCGCAAACGGACTCACGTTTGCCGCTAGCCGCATATTCACTATCACGTATACGTCTAGAATCTGTCGGGCGACTCTTGGAAGTCGCTCGTGGTGAACGGGACAAACTCCGTGAAGACCTCGGTGAAGCAAACCGACGTGCCAGGCTGCTAGCACAGGACGTGGACGAGAATCACGATCGCATCGAAGCCGAGCTGAAGTCGAGGCTGCGTCAAATGGAGGCCCGACACGCCGAGGCCGTTCGCGTTGCGGCCGCCGAAGCCGGGGTCGAGAGGGAGCGAGCCGCTGCAGCGAGAGTCGCTCTAGAGGAGGAAGCAGCGCGTCGGGCGGATACAGAGATTCGTCTTCGGGACGAAATAACTTCACTTCAAGAACGCATCGAAGAATATCAGGAGCGTGCTGCGGCTGCAGAGGCTCGGTGTGAACACGCGGAGCGCGAGCAGAGTAGGCTGCAAGAGGAATTGCGGCGGGCCGAGGAGCGTGACGCAGAGAGGGCGTCGCGGGAGGAAATCGCTAGGGACGAGCTCGAGACTCGCTTAAGAGAATCGAGGGAGGAGCGAGCTCGCCTGAGGGACCGCTGCGACGAGCTCGGCGCAGCGCTCGAGGTGGCGGGGGCCACGAGGTTGGCGAGGGAAGGAGCCGCGCGCTGCTGGCGGGACGAGGTGGATGCGCAGGCGCCGGACTCGTTGCCTGATGTGTGTGACCTCTCGCTGCAATCTTTTGATAAGAAAGATGCCCAGGAAATAATAGAGAAGTTGATGACGTTTCTGACTAATATACGAGAAATATCAGTTGGAGACGGGAACAGCTGCGCCTCGTGCGACCATATTGCGAAAACCGCTGCGACTTTCCGGGAAGCGTTAAGAAGCTCAAGATATGAAGGCGCAGACACGATAGGACATAATCACGTGACGGAGAGGGACGACGAAGGGGCCCAGACTGACCTCGAGAGCGTCCAGGAGGTGGAGGCTCTGAAGAAGACACTCGCGGACCTCGAGCACGATCATCGCGTGGAAAAAGATAGTCTAACGAACGTCGTAAAGGAACTGGAGACAAGTTTAGAGCAAATGAAAACGGAATACGAAAAATGCGAGGAATACTGGAGTGACAAGTTGGAGGAGGAACGGGAGGCATTCGCGGAGGAGCAGCGCGCCGGGGATGAGAGGCTGGCGGACCTCGCGGCGCGCATCGCGGACTACGAGAGACAGTTCGCGCCCGCGCCGCTACCCACCATCGACGAAAGAGATCAGCTCGAGCTGCAGGTCAACCAGCTGCAGGACGAGTTCGACACCTACCGCAGGACTCATGAGGCCGAGCTCGCCGCTAAGTCGGAGGAGCTGCGCCGACTTCACCGGCGGCTGGAGGCGGCCGAAGGCGCATCCTGCGTGTGCGGCGGTGCGGCGGCGGCTCGATGGCGGGCGGCCGTGTGGAGGGAGGCGGGGGCGCTGCGGGCTCGTGCTGGGCGGGCAGAGCGGGCGGCTCACAGGCTCCACGCGCGACTCGCGGCCGCCGACCTTTTGGTCAAGGACCTCTACCTCGAGAACTGCCGCCTGGCGCACGGACCGCGCCTGCCCTGA

Protein sequence:

>DPOGS209471-PA
MDSPMNRYEQQLYSVFKTFDIDNEEALKRSAVLQLCDSLQLEDRGAALVDTLFDRQSDRITFSQFRNGLLSVLDKDPIRRQENCSSKESVVPKTPSPAQSDDDSREVGAKFAFGSERYGRRSRPHRVTAEDTPRPRVASESRLDSVRARRRMRCKRSSSAMESREDDVTGLELDHEQRVDRDRALALCRGLDMHGVDRFLVDRVFAASDENEVTVGEFFDRLNASLTTSIETSLDGGIVEGDSKSDSDIVSSDTVVETWKRAGVRKPMGLLLELGFGEAALRLQSLEHVLDDELRANSSQTDSRLPLAAYSLSRIRLESVGRLLEVARGERDKLREDLGEANRRARLLAQDVDENHDRIEAELKSRLRQMEARHAEAVRVAAAEAGVERERAAAARVALEEEAARRADTEIRLRDEITSLQERIEEYQERAAAAEARCEHAEREQSRLQEELRRAEERDAERASREEIARDELETRLRESREERARLRDRCDELGAALEVAGATRLAREGAARCWRDEVDAQAPDSLPDVCDLSLQSFDKKDAQEIIEKLMTFLTNIREISVGDGNSCASCDHIAKTAATFREALRSSRYEGADTIGHNHVTERDDEGAQTDLESVQEVEALKKTLADLEHDHRVEKDSLTNVVKELETSLEQMKTEYEKCEEYWSDKLEEEREAFAEEQRAGDERLADLAARIADYERQFAPAPLPTIDERDQLELQVNQLQDEFDTYRRTHEAELAAKSEELRRLHRRLEAAEGASCVCGGAAAARWRAAVWREAGALRARAGRAERAAHRLHARLAAADLLVKDLYLENCRLAHGPRLP-