Monarch geneset OGS2.0

DPOGS208768
TranscriptDPOGS208768-TA1368 bp
ProteinDPOGS208768-PA455 aa
Genomic positionDPSCF300036 - 926109-932278
RNAseq coverage37x (Rank: top 73%)
Annotation
HeliconiusHMEL0132153e-5140.20% 
BombyxBGIBMGA006489-TA2e-5137.29% 
Drosophila% 
EBI UniRef50UniRef50_UPI00020623861e-6040.87%UPI0002062386 related cluster n=1 Tax=unknown RepID=UPI0002062386
NCBI RefSeqXP_002166674.12e-5337.71%PREDICTED: similar to AGAP001049-PA, partial [Hydra magnipapillata]
NCBI nr blastpgi|3287192484e-6040.87%PREDICTED: hypothetical protein LOC100574892 [Acyrthosiphon pisum]
NCBI nr blastxgi|3287192484e-5840.87%PREDICTED: hypothetical protein LOC100574892 [Acyrthosiphon pisum]
Group
Gene OntologyGO:00036763e-28nucleic acid binding
KEGG pathway 
InterPro domain[298-449] IPR0048753e-28DDE superfamily endonuclease, CENP-B-like
Orthology groupMCL11171 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208768-TA
ATGATGAAGAAATCTGTTCCTCGTGGAACTTATGAAAAGCAGATGACTGAATTTGACGGTATTGATTTAACAGCAGTTACTTGGAAGGACAATAAAGTTGTAACTTTTCTATCTTCATATGTTGGAGCTGAACCTGTGGGCCAAGTAGAAAGATTTGACAAGGCAAATAAGACTAGAATTAAAATTTCATGCCCACATATAATAAAGGAATATAATGCCCATATGGGCGGAGTGGACCTAATGGATAGCTTTATTGGGAGAACCAGGGGAGCTGGATCTATTTTGCCTAGTGAAAAGGCAATGAAAAAGAAAAAGAGAGGAGCTTATTCAAAGGTCGTATGTGACAAAACCAAGCTGGCTCTCGTTCGCTGGAACGACAATAAGCCTGTCACCCTGATAAGTTCATTTGTTGGATGCACACCTGTGCATAAAATAAAGAGATATTGTAAGGTAGAAAAGAGAAAAGAACAGTTTCGCAAAGTATGTTACAGTGTAGCTAAGCAACTAGGTATTGAAAAACGGTTCAACCAAGTCAATCAAACTGCAGGAAAAGATTGGCTTGCTGGATTTTTTCAGCGTCATCCAGATTTGAGTATTAGAAAGCCTGAAGCCATCAGTATCAACCGAATCCTTGGATTTAATAAGACTGAGATAACGCTGTGGGACCTACATTACCTACCACACGCAGCGTCCATACAAAACATCAAGCATATGACCGTCACGAGCCGAAGGCGCGCGGTTACTAAAACGCCGGGTTTTGTATGGACACAGATAGGCGCGTATTTATATGGTGCCCCCCGCGGCGACGGTGCGCCCCTCACCTCTCGTACAATTCCATCAACAACGCTTTTTTTCAATAACTTGGAGAAATTAATGGAGCAACATAACTTCGAGCCTCAAATGATTTACAATGTAGACGAGACTGGTATAACGACCGTCCAAGAGACCGAAAAAATTATAGCCCTCAAAGGACAAAAACGCGTCGGGTCATGGGAAAGAGGTAAGACGGTCACTGTTATATGTGCTGTAAGTGCGTCTGGATCGTTCGTTCCTCCGCTTTTCATATTTCCTCGTCAACGTCACTCACCACAGCTGGAAAAAGATGGACCAGTGGGAGCCGTTTACACATGTTCACACAGTAGCTGGACTAACGAAAAGATATTCGTTTTATGGCTTCGTCATTTTATCAAACACACCAAGCCTTCTGCTGAAACACCCGTATTGCTAATATTGGACAATCATAACAGTCATGCTACTCTCGAAGCTTGGGAATTAGCAAAAGAGAACCATGTCATAATGCTAACCATTCCTCAGCATTCATCCCACCGTCTACAGCCCCTTGACGTTGTCAGAGGAGCATTTCCCTAG

Protein sequence:

>DPOGS208768-PA
MMKKSVPRGTYEKQMTEFDGIDLTAVTWKDNKVVTFLSSYVGAEPVGQVERFDKANKTRIKISCPHIIKEYNAHMGGVDLMDSFIGRTRGAGSILPSEKAMKKKKRGAYSKVVCDKTKLALVRWNDNKPVTLISSFVGCTPVHKIKRYCKVEKRKEQFRKVCYSVAKQLGIEKRFNQVNQTAGKDWLAGFFQRHPDLSIRKPEAISINRILGFNKTEITLWDLHYLPHAASIQNIKHMTVTSRRRAVTKTPGFVWTQIGAYLYGAPRGDGAPLTSRTIPSTTLFFNNLEKLMEQHNFEPQMIYNVDETGITTVQETEKIIALKGQKRVGSWERGKTVTVICAVSASGSFVPPLFIFPRQRHSPQLEKDGPVGAVYTCSHSSWTNEKIFVLWLRHFIKHTKPSAETPVLLILDNHNSHATLEAWELAKENHVIMLTIPQHSSHRLQPLDVVRGAFP-