Monarch geneset OGS2.0

DPOGS208752
TranscriptDPOGS208752-TA4362 bp
ProteinDPOGS208752-PA1453 aa
Genomic positionDPSCF300043 + 535264-543634
RNAseq coverage423x (Rank: top 29%)
Annotation
HeliconiusHMEL0152110.060.40% 
BombyxBGIBMGA003413-TA3e-10556.21% 
DrosophilaMBD-R2-PB1e-7930.16% 
EBI UniRef50UniRef50_D6WKQ63e-9333.08%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WKQ6_TRICA
NCBI RefSeqXP_001121736.16e-9934.25%PREDICTED: similar to MBD-R2 CG10042-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|910948659e-9333.08%PREDICTED: similar to phd finger domain [Tribolium castaneum]
NCBI nr blastxgi|2700065812e-9732.81%hypothetical protein TcasGA2_TC010453 [Tribolium castaneum]
Group
Gene OntologyGO:00036773e-18DNA binding
GO:00056342.6e-14nucleus
GO:00036766.3e-14nucleic acid binding
GO:00055157.7e-05protein binding
GO:00082707.7e-05zinc ion binding
KEGG pathway 
InterPro domain[732-859] IPR0161773e-18DNA-binding, integrase-type
[733-836] IPR0017392.6e-14Methyl-CpG DNA binding
[4-86] IPR0066126.3e-14Zinc finger, C2CH-type
[1167-1237] IPR0110115.9e-12Zinc finger, FYVE/PHD-type
[225-290] IPR0161971.2e-10Chromo domain-like
[1175-1233] IPR0130834.1e-09Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL16432 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208752-TA
ATGGCTGTAAAAAAATGTTGTGTAGAAAACTGTAATTCATCTTCAACAAGACCAGAAGACATTGGTGTTACATACCACAAGTTCCCTAAAGATAAGACATTACGTGATTTATGGTCGTTGGTAACGCACTACAAACAAACTAACATAGACTCTACGACATACGTGTGCTCGCGTCACTTTTGCAAAATCGATTTTCAAATTTACGAGGACTCAAAATACATTCTTAGATCAGATTCCATTCCCTCTATATTTTCATGGATCCAAAGAGATAAAGATACAAAGATACAGCAATTAGAATCTAATATGGATGAACCTAATATTTCTGGGGCAGCATCCCCTGTAAATGAAAGCTCAGATGGAGGTGCTGCAAACCTGAACACCTCATCCAGCAGTAAAGAATCCGAAGGTGAAAATGTTGAAGCTATTATGAAATTCATTGAAGAACAAGAACAGGAAATAAAAAAGCAACAGAATGAGAGCCAATCTGAAAAGATTGCAAGCGATCAAAATGTGCCGCTAACCCATAATGACAATATTGACAATAATGACAATAATCAAGTGTTCAGTGACATAGCGGAGCCTATGGTGATAGCGACGAGTGTTATGGACATGATTCTCAGTGAATCAGAAGCTAAAATAGACACAAGGAAAAATATTAAACCAATACCACAGAAACTGAGTAAAAATGATAAAGGCAACAGTGTTTCACTGTCGGTCGGATCTAAGGTTGAGGCCAAAGATTATGGAGAATTTTGGCATTCGGCTCAGATTGTGGAAGTGGACTATGACGAAATGGAAGTTCTGGTGCATTATGAGAACACACACAACAAACCCGATGAATGGATAAGTGTGAGCAGTCCCAGATTGAGGCTTACGAACAATCCTACACAAAGCACCCCAGCGAGAAACGTCAGGACTGAAATAAAACCTGAAAAGGAAGAAGTTAAGGTGGAGGAGAAACCAAAACAACAGTTTGTTGTTGGTGAAAGATGTCTGGCTCGTTGGAGGGACAACAGACGTTTCATAGCCACAATACTAACAGATCTCGGCAATGGCAATTACGAGATTATGTTCGACGATGGTTTCAAATGGAAATGCACTACGTCAAGGATGTGTAAGCTCAAGGAGTCTAAGACGGAACCGCTGGCCATCGATACGTCAGCGTCAGCATCATCTTCCAGTCTTTCACCGATACCAATTCCTGGGACGGGTCCGACGGGAAATATCCCAAACAGCCAATACACGTTCCACACACATCTATTCGATCCCACCCGCGATTATTTGGGCTCTAAGAGCGAGAGGCGAGAAATGAAACGCAAATTGAACATAAAAGAGATATTTAATATAGGTCAGAAGAAACAAAAGCGAAAAGATAGCGATCAAGGGAAACCGAAAATTGGTAAGGTGAAGAAGGCAAGGGTTATTAAGAAGAAGGTTGACATTAAACCGGAAGCTGAGACCGAAGTCAAGCTTGAAGTGTCGCAAATAACAATGGAGATAAAAAAAGAGATTCCCGATGCAGTCGCTTCGATAATTGGTACTGTCAGTAAAGATGAAAGCGATAAAACTAAAATTGATACAGAAATTAATATACCGATGGACGCTGCTAAGGACGTGGAAGAATCTAGTGCTGAGACTAACATTGAAGACGCAAATGTATTAGACGTAAAAAATACAGGAACGGCTTTCGATTCAGAGGAGGTCGTTGAAAAAAATGACATCGGAGATAGTTTAGTTAGTAGCTCAGATCTGTTAGTCCCTAGCGACTTAGGCTTCCAATCAGAGCCTACATCGGATCCTGTCATGGAAGAAGAAGCGAAATTAGAACAATTCGAAAAACCGGATGAAAGTAAACAAGAGGTTATAGAGAAAATGAAAGAGGTTATATGCAAATTAGAGGGCGGTTTAGATATACATAAGATAGATACAACGAAACGTGAAGTAGACACGAAGCCAGTAAGCGAAGTTGATACAACAGTAAAAGACGACAGATCGAAAAGGAAGCTGTCGAAAATAAAGAGGAATAAAAGATTGAGAATGTTACAGGAGAAGAAAGTTAAGAAGCAGGTGGAGAAAGTGAAGAACGAGCTGGTGGAGATGAGGAAGCAGATGGAGGAGATGAGGAAACAGATGTTGATGAAGACGGAGGAGATGGCTCGCCCGCACGAGATGCCCGAGAGCTTCCTGCTACCGGGAGAGTGGTGCTGCAAATGGCTCAACGGGCAACCCCTGGGCAATGTGTGCGAGTTTGAAGATAAAGTCGACGGCAAAGGGCTTAAGAAGATGAGCGTTCAGGTCGAGGATAAAAGACTGCCTCCAGGGTGGACGAAACTTATGGTGCGTCGAAGTTTTGGACAGTCCGCTGGGAAATGGGACGTCGTTTTAGTTGGACCGGAAAATCGTAGGTTTCATACTAAGACGGATATACGGAACTATCTCGAGCAGCACGACGACTCCCTCAAGCAGTACGAACACGCGCTGTTAGATTTCGGTGTACATCTGAAGCTGTCCCGTAGGATGGGATGGTATACGACGGATGGCGGCGTTGCACCGGCACTGGTGAAAAGAAAGAAATTAGGTATAAACAGGAAGGAAGGAAAGAAAAGAAAGAAAGAGAAGTCAGCCAAGCGTGATATATCCTTGGAAAGTTTTTATAAACGTACATTTTACCCGGAAAGCCCACCGGTCTTCCTGGAAAATCCCGTGGAAGACGATGGTTCTGTGTACGTCGGTTCTATGAAGGTGGAAGTGATCGATAATCTCTTACGGTGTCCAGCTGAGGGCTGCCTGAAGAACTTCCGAAATACCACACTACTGCAGATGCACATAAAACATTACCACAGAGAAATGAGGGAAATGTTGGGAGCCACCCCAAAAGTTTTGGACTTAGCGCGCGAAAGAACGAAACCCACTGATATCGAAGTCAAAAAGACGGAATTTGAATCCAAAATTATTAAAGTCAAGCTACCAAAACTGCCGAAGAGATCCGAGGAACCCAAAAGTCAAACGAACCCAGAAGTCAAAGAGCCCATTGTGCAAAAAGTAGAGCCTCGACCACAAACACCACCTAAACTGGATGTGCCCATACCTAGATCACAGGATTCTCCTAAACTAAGACAAGCACTAATCACCAAACCGGCTAAAAGACCGAAAGTTCTACTCCCAGTTAGAAAACCAGAGCCAGAAGAAAAAGAAGAAATCCCCGAGGAAGCTGATGTAGAAAAGATAGATTTCGACGACAGCTCCAATACTGCAGAGAAACCGTTTGAGGAGTTCCGAAGGAAGTGTGATAAAAAGCGCAAATGTTTTTCAACTGTGTCAAGGAAGCCTATCAGCGAGGAGGACGAGTGGTTCGGTGTGAACTCTGACCTTGACACTCGGTCCAGTTTCCCAGGGTCTGGCACACCGGACTCCAAAAACATGGACAAGGCAGTACCACTTCCGGTTTCCTCCGAGTCCAATGAAGAACAGAAGGACGGCAATATGTATATGTATACAGAGACTGGCGAACGTATAAAGATCGAGCACATGAAACGCGAGGAGATCATAAACTGTCATTGCGGTTTCCGCGAGGAAGACGGGCTGATGGTGCAGTGTGAACTCTGCCTGTGTTGGCAACACGCGCTGTGTCACAACATACAGAAGGAGTCGGAGGTTCCAGAGAAATACACTTGCAGTATATGTCTCAATCCTCGGCGTGGGAGACGCTCCAAGCGGTTCTTGCACGATCAGGACAGACTGTACGAGGGGTTGCTGCCGGGGGCGAAGCCCTGCGAGACTTTGCGACGCTCTCACGAATTATCAGCTAACCTATTGAAAATTCAGGATGCTCTGCATGCAATGCGAGTCAAACACTATGTAGCTACTAAGAAAGACCACCCAAAATTATATCTGTGGGCCAAAGACTGGGAGAGTCCAGAGGTAAATTTCACCCAAGAAAGACTTAATTCAGATTACTCAGATCTGAATATTATCATAAATAACATCGGCAAGGAGAATTTGCCGCTGAAACCCGATGAAGTTAATCCACATCTGGATATAAGAATGCCCATAACTGAAGAGCCTGAAGATAGATTCACTCAGAGAGACAAACAAGAAGTACAAAGAGTGGTCATCCCTCAGCCCGAGGCAGCCATTGAGAACAGTGCATGCAGGGAACGCTTGCTGCGACATGTGCAGCGCTGTCAGGGCTTCATTGACGCCAGACTCGATTCTATAGAAGCTCAAGTAGCCGAACTCGAATCTCAAGATCCATCATTTGAGGATGATGAGACAGCGGATTTCTTCCCAAGAACAAAACAAACTATCCAAATGCTGATGAGGGACCTCGATACGATGGAAGAACTGGGAATTATATCTTGA

Protein sequence:

>DPOGS208752-PA
MAVKKCCVENCNSSSTRPEDIGVTYHKFPKDKTLRDLWSLVTHYKQTNIDSTTYVCSRHFCKIDFQIYEDSKYILRSDSIPSIFSWIQRDKDTKIQQLESNMDEPNISGAASPVNESSDGGAANLNTSSSSKESEGENVEAIMKFIEEQEQEIKKQQNESQSEKIASDQNVPLTHNDNIDNNDNNQVFSDIAEPMVIATSVMDMILSESEAKIDTRKNIKPIPQKLSKNDKGNSVSLSVGSKVEAKDYGEFWHSAQIVEVDYDEMEVLVHYENTHNKPDEWISVSSPRLRLTNNPTQSTPARNVRTEIKPEKEEVKVEEKPKQQFVVGERCLARWRDNRRFIATILTDLGNGNYEIMFDDGFKWKCTTSRMCKLKESKTEPLAIDTSASASSSSLSPIPIPGTGPTGNIPNSQYTFHTHLFDPTRDYLGSKSERREMKRKLNIKEIFNIGQKKQKRKDSDQGKPKIGKVKKARVIKKKVDIKPEAETEVKLEVSQITMEIKKEIPDAVASIIGTVSKDESDKTKIDTEINIPMDAAKDVEESSAETNIEDANVLDVKNTGTAFDSEEVVEKNDIGDSLVSSSDLLVPSDLGFQSEPTSDPVMEEEAKLEQFEKPDESKQEVIEKMKEVICKLEGGLDIHKIDTTKREVDTKPVSEVDTTVKDDRSKRKLSKIKRNKRLRMLQEKKVKKQVEKVKNELVEMRKQMEEMRKQMLMKTEEMARPHEMPESFLLPGEWCCKWLNGQPLGNVCEFEDKVDGKGLKKMSVQVEDKRLPPGWTKLMVRRSFGQSAGKWDVVLVGPENRRFHTKTDIRNYLEQHDDSLKQYEHALLDFGVHLKLSRRMGWYTTDGGVAPALVKRKKLGINRKEGKKRKKEKSAKRDISLESFYKRTFYPESPPVFLENPVEDDGSVYVGSMKVEVIDNLLRCPAEGCLKNFRNTTLLQMHIKHYHREMREMLGATPKVLDLARERTKPTDIEVKKTEFESKIIKVKLPKLPKRSEEPKSQTNPEVKEPIVQKVEPRPQTPPKLDVPIPRSQDSPKLRQALITKPAKRPKVLLPVRKPEPEEKEEIPEEADVEKIDFDDSSNTAEKPFEEFRRKCDKKRKCFSTVSRKPISEEDEWFGVNSDLDTRSSFPGSGTPDSKNMDKAVPLPVSSESNEEQKDGNMYMYTETGERIKIEHMKREEIINCHCGFREEDGLMVQCELCLCWQHALCHNIQKESEVPEKYTCSICLNPRRGRRSKRFLHDQDRLYEGLLPGAKPCETLRRSHELSANLLKIQDALHAMRVKHYVATKKDHPKLYLWAKDWESPEVNFTQERLNSDYSDLNIIINNIGKENLPLKPDEVNPHLDIRMPITEEPEDRFTQRDKQEVQRVVIPQPEAAIENSACRERLLRHVQRCQGFIDARLDSIEAQVAELESQDPSFEDDETADFFPRTKQTIQMLMRDLDTMEELGIIS-