Monarch geneset OGS2.0

DPOGS214648
TranscriptDPOGS214648-TA1485 bp
ProteinDPOGS214648-PA494 aa
Genomic positionDPSCF300050 + 762420-767798
RNAseq coverage228x (Rank: top 44%)
Annotation
HeliconiusHMEL0091130.089.71% 
BombyxBGIBMGA005064-TA0.084.20% 
DrosophilaCG17446-PA4e-14048.40% 
EBI UniRef50UniRef50_E2AND09e-17160.98%CpG-binding protein n=12 Tax=Neoptera RepID=E2AND0_CAMFO
NCBI RefSeqXP_001606331.15e-17661.04%PREDICTED: similar to cpg binding protein [Nasonia vitripennis]
NCBI nr blastpgi|1565458469e-17561.04%PREDICTED: PHD finger and CXXC domain-containing protein CG17446-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|1565458462e-17859.88%PREDICTED: PHD finger and CXXC domain-containing protein CG17446-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00055151.2e-12protein binding
GO:00036774.1e-12DNA binding
GO:00082704.1e-12zinc ion binding
KEGG pathway 
InterPro domain[237-468] IPR0220563.2e-107CpG binding protein, C-terminal
[34-85] IPR0130832.2e-15Zinc finger, RING/FYVE/PHD-type
[24-101] IPR0110114.6e-15Zinc finger, FYVE/PHD-type
[37-84] IPR0197871.2e-12Zinc finger, PHD-finger
[128-171] IPR0028574.1e-12Zinc finger, CXXC-type
[37-83] IPR0019654.9e-10Zinc finger, PHD-type
Orthology groupMCL13277 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214648-TA
ATGAGTGAAAAGAAGTCTAAACAAAGTAAAGCAGATATTGCTAAACAGTTTGATCTACCCGAGCGTCAATCGAAGATTACGTCGCTTTTAAATCAAGCCGGGCAAGCTTACTGTATATGCCGATCTTCAGATAGTTCGCGGTTTATGATAGCTTGTGATGCTTGCGAGGAATGGTATCACGGAGATTGTATAAACATTTCTGAAAGAGAGGCGAAGTATATTAAAAACTATTTCTGTGAACGCTGCCGTGAGGAAGATCCCACTTTAAAAACAAGATTCAGGCCACAGAAACGAGAAAATGATGGAGATTCTGGTAGAGATGATAGAAAGAAAAAACGCAAAGAAAAAGATCATTCAGAAAACAAGTCATCAAAGAGACCAAATAAAGATGGTTGTGGAGATTGTGGTGGCTGTTCACAAACACACGACTGTGGCCACTGTGACGCATGCGAGGATATGTACAAGTATGGAGGTAATAACAAGTTAAAGTTGACCTGCCGTCAAAGATTGTGCGTTAAAAGCAAAAAGACCTCTAGAATATCAAGTAGTAACCGCATAAAGAAGAAGCATGAACGTGAGGTTTATGAACCCGAAGAGTCAATATCCCACCTTCAGACATCAGAGCCTCGGCAATGCTATGGACCGCAATGCTGTAAGGCAGCTCAGTATGGCTCCAAATACTGTTCCCAGCAGTGCGGAATGAGACTTGCTACCGCTAGGATTTATCAGGTGTTGCCGCAGCGCATTCAGGAGTGGTCGCTCTCCAGCTGTGTGGCGGAGCAGCATAACCGTCGTTCATTGGAGGTGGTCAGGTCGGGGCTGGCTAAAGCGCAGGCAGCGCTGAGGGCGTTAGACAAACAGCACGCTGAGATAGACGAGATGCTGCAGCGAGCTAAGCATGCCACCATAGAACACACTGATGAGAAGGAAGCAGACGACGAGACCTCAATGTACTGCATCACGTGCGGCCACGAGATACACTCGCGCTCAGCTGTCAAACATATGGAGAAATGCTTCATAAAGTACGAGGCCCAGGCCTCGTTTGGCAGCAGACATCGCACACGGATAGACGGACAGAGCATGTTCTGTGATTACTACAACCAAATCAATGGCACCTATTGTAAGAGGTTGCGGGTAATGTGCCCCGAGCACTTCAAGGATCCCAAGGTGAGCGACACGGATGTATGCGGCTGTCCGCTGGTGAAGAACGTCTTCGATCCCACCGGAGAGTTCTGTAGGGCGCCGAAGAAGTCCTGTCTGAAGCACTACCAGTGGGAGAAGCTGCGGCGGGCGGAGGTTGACATGGAGAGGGTCCGCCAGTGGCTGAGGCTGGACGAGCTGGTCGAGCAGGAGAGGAATATACGCCTCGCTATGGCCTCCAGGGCCGGCGTTTTAGGTTTGATGCTTCACTCGACGTACAACCACGAGGTCATGGAGAGGATAACGAAGGCGAACGAAAACGGAAAGGTCAAAGAGGGGTCATGA

Protein sequence:

>DPOGS214648-PA
MSEKKSKQSKADIAKQFDLPERQSKITSLLNQAGQAYCICRSSDSSRFMIACDACEEWYHGDCINISEREAKYIKNYFCERCREEDPTLKTRFRPQKRENDGDSGRDDRKKKRKEKDHSENKSSKRPNKDGCGDCGGCSQTHDCGHCDACEDMYKYGGNNKLKLTCRQRLCVKSKKTSRISSSNRIKKKHEREVYEPEESISHLQTSEPRQCYGPQCCKAAQYGSKYCSQQCGMRLATARIYQVLPQRIQEWSLSSCVAEQHNRRSLEVVRSGLAKAQAALRALDKQHAEIDEMLQRAKHATIEHTDEKEADDETSMYCITCGHEIHSRSAVKHMEKCFIKYEAQASFGSRHRTRIDGQSMFCDYYNQINGTYCKRLRVMCPEHFKDPKVSDTDVCGCPLVKNVFDPTGEFCRAPKKSCLKHYQWEKLRRAEVDMERVRQWLRLDELVEQERNIRLAMASRAGVLGLMLHSTYNHEVMERITKANENGKVKEGS-