Monarch geneset OGS2.0

DPOGS215869
TranscriptDPOGS215869-TA2613 bp
ProteinDPOGS215869-PA870 aa
Genomic positionDPSCF300029 - 666859-677354
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0215312e-6352.63% 
BombyxBGIBMGA000417-TA3e-7332.74% 
DrosophilaCG7031-PA2e-1447.87% 
EBI UniRef50UniRef50_E2C2Z12e-2359.55%Putative uncharacterized protein n=1 Tax=Harpegnathos saltator RepID=E2C2Z1_HARSA
NCBI RefSeqXP_001121697.15e-2557.30%PREDICTED: hypothetical protein [Apis mellifera]
NCBI nr blastpgi|3407186871e-2361.80%PREDICTED: hypothetical protein LOC100646832 [Bombus terrestris]
NCBI nr blastxgi|1700378515e-2858.76%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00036764.4e-08nucleic acid binding
KEGG pathway 
InterPro domain[754-802] IPR0048754.4e-08DDE superfamily endonuclease, CENP-B-like
Orthology groupMCL21189 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215869-TA
ATGTACAATAAGTTAATTTGGTTAGCTATTATCGCGTTAATGCCCGCTGAGGGTAAGAAATCGAGAAGCAATGACAGTGATTCTGTAGAATCGACACCACAACCGAATATTCTTTTATATTCGACATTCGGTTTCAATGATGTCGGTCCTCACGATGGATTTGTACAAAGTTCACCGGATTACGCCACCTATCCTAACAGACAGCAGGATTCCCGAGTGAGACTTTACGCACCAGCTTTTCCATCGGCTATAGAGGCCACCGGTTCAGAGGGCTATTACGATCCTGACCAAACACCACTCGGTGCTGATAACGAGGACCAAACAAACGTTAACATATTCCAACCGGTTCCTATAAATTTCTTTACACCCAGTCATGAATCCTTCAACGAGAAAAAGAATACGGATACAGAAAATACATTTGATACGAGAGACGATAGCTCAAATCATCCAATATACGGGACAAAAATTAATTCTAAAGCCAGAAATAAACCAGCCAAAGACTTGAATACCACTGAATATAATATTTTGGACAGTGTATCCACGGTCATTGAAAATAATAATGACGACTATAATACTAATCTAGAATCTAGACATCAATCGAAATCTGTGGAAAATCAATACAATGCTAACTATCCATCCATAAATTCTCCAGGCTCATTTCCAAAGGTAATTGATTTCACCGATGCTAAGAAATATTATCCAAGTAGCGTCGAAAGTAAATATACAAACCTTAAACCTATTTATGTGAACGCCTTCCAAACAGACGAATTTAATAGTAAAGATGAGGTTAATAATGATCAGGGAAGCACTAATCAAGTGTCTTCATATAATATTAACGGAAAAGACAGCGATGACTTTACAGACAAAGAGACGTCACCAGTGAGCTTTTCGAATCAAGACCATTTACCTTCTTATTCTAACGTTAAAAACAATTTGAAGACCAACAAATATAAACTAGATTACAAACAGAAATTAAAAACAAAGTACTCCAACAATAACAACGACGAATTTAACAAATTGCGAGACCTAAAAAAAGGTTATGAGTATTCAACTAACTACAGCACCGCCAGTTTTCATTACGACTATGACAAACCTAAGAGACAGTTTAATAGCAGCATAGATGAAGTGTCACCTGAGAGTAGCAATATAAATATTGTGGGTCACCGATATCCAAATAAAGAATTTTCCAGTTTAAGAACTTTACCATTCTCGTTACCAGATCCTGATATTTACAAACCATCAGAGGAATATTTAAATGCGTTTAAAAATTATTACTCCGATGTACCTTCTACTGCTTCTCAATGGAGTAGCTTTTATAAGCCAATTGAATTGTCATCTCATAAGAAACAACCGAAAAAAACATATTTATTCGATGATGACAATGAAGAAATCGTTCACATACCGAAAAGGCCACACAGTAGTAAATATGGAATGTATTCTGACAGTGCAATAAACGAATGGCCGTATGCAAACGAGTACACATCCCATAGTCCTAGCGTAACAAAACAACCAGAATGGGCTAAAGATTATATTAGAAATAAATTCAAAACAGAAGAAGATCTTTTAGGACTACGAAACCACGATGACTTTTCTCCGTCTCATCCGAGCTCATTCAAGTATAACGATCTGACGAAAGAATTTGATTTCGAAAATTTATCAGAGAGGTGGAGACAAAATTTTTTAAAATCAAAGGCTAAAGAATCTAACCGTGACTACGAAAGTTACGCATCAGATACTAAACCGATTCACATATCTCGTCCCAAACCATACGCCATTGAGATACCTCACCCGGTGATAGTGCCGGTGCCGCAGCCGTATCCCGTTAGGGTTCCGATACCAAAACCGGTAGCCGTACCGGTTATACAGGAGCTTACCGTACCGGTTGAGAAGCCGATGCCGTATCCGGTCATCAAAAAGGTTCCTTATATCGTAGAGAGGCCCGTTCCCGTACCTGTTGAAAAACAGGTGCGTGTCCCGGTAGTGAAGCCTTACCCGGTGCACGTCCCTCAAGTGAGGCCAGTGTTTCATCACTCACAGTACGAAGACGATGACGAATATGAAGCCCGCCCGGACACCAGGCGACCTGTCTACAAACGACACAAGAATACACATGGGCGCGCTCGTCCGACAACAAGACGGCCAACACGTACGGCCTATCGTGACAACCAAAGACGACGTCCAGTCAGAAAACCAGGGATTCAGGGAGGCAGAAAACATCTGAGAAGACCTACAAACTATCAAACAGAACACTCCAGAACGGTCATATCTAAAACAGAAAAGTCTGCGCCAGGATTTAAAGCTAGCAAGGATAGACTGACTCTTCTTCTTGGTTGTAATGCAAATGGTGATTTCAAATTTAAACCATTTCTCATTTATAAATCAGAAAATCCAAGAGCTATCAAAGGTTGTTCAAAGAATCTGCTTCCTGTCCTGTTTTGGAAAAAGTTCAGCATTATGAAAGCCATAAAAATCAGTTCTGATTCGTGGGAAGAGGTCAAAACATCTTGCATGAATGGTATTTGGCGCAAAATATGGCCTAAATGTGTTAACCGAAGCGCTGCGGAAGTCTATGGTACTCCAGCTGTTCGTCATGAAATTTCCAACTTAGCTTGA

Protein sequence:

>DPOGS215869-PA
MYNKLIWLAIIALMPAEGKKSRSNDSDSVESTPQPNILLYSTFGFNDVGPHDGFVQSSPDYATYPNRQQDSRVRLYAPAFPSAIEATGSEGYYDPDQTPLGADNEDQTNVNIFQPVPINFFTPSHESFNEKKNTDTENTFDTRDDSSNHPIYGTKINSKARNKPAKDLNTTEYNILDSVSTVIENNNDDYNTNLESRHQSKSVENQYNANYPSINSPGSFPKVIDFTDAKKYYPSSVESKYTNLKPIYVNAFQTDEFNSKDEVNNDQGSTNQVSSYNINGKDSDDFTDKETSPVSFSNQDHLPSYSNVKNNLKTNKYKLDYKQKLKTKYSNNNNDEFNKLRDLKKGYEYSTNYSTASFHYDYDKPKRQFNSSIDEVSPESSNINIVGHRYPNKEFSSLRTLPFSLPDPDIYKPSEEYLNAFKNYYSDVPSTASQWSSFYKPIELSSHKKQPKKTYLFDDDNEEIVHIPKRPHSSKYGMYSDSAINEWPYANEYTSHSPSVTKQPEWAKDYIRNKFKTEEDLLGLRNHDDFSPSHPSSFKYNDLTKEFDFENLSERWRQNFLKSKAKESNRDYESYASDTKPIHISRPKPYAIEIPHPVIVPVPQPYPVRVPIPKPVAVPVIQELTVPVEKPMPYPVIKKVPYIVERPVPVPVEKQVRVPVVKPYPVHVPQVRPVFHHSQYEDDDEYEARPDTRRPVYKRHKNTHGRARPTTRRPTRTAYRDNQRRRPVRKPGIQGGRKHLRRPTNYQTEHSRTVISKTEKSAPGFKASKDRLTLLLGCNANGDFKFKPFLIYKSENPRAIKGCSKNLLPVLFWKKFSIMKAIKISSDSWEEVKTSCMNGIWRKIWPKCVNRSAAEVYGTPAVRHEISNLA-