Monarch geneset OGS2.0

DPOGS212418
TranscriptDPOGS212418-TA5352 bp
ProteinDPOGS212418-PA1783 aa
Genomic positionDPSCF300258 - 53168-58642
RNAseq coverage436x (Rank: top 28%)
Annotation
HeliconiusHMEL0096160.069.67% 
BombyxBGIBMGA002817-TA0.066.87% 
DrosophilaMESR4-PB5e-12643.12% 
EBI UniRef50UniRef50_E2BEC11e-13445.97%Chromatin modification-related protein YNG2 n=1 Tax=Harpegnathos saltator RepID=E2BEC1_HARSA
NCBI RefSeqXP_001960743.11e-14037.78%GF11347 [Drosophila ananassae]
NCBI nr blastpgi|1947569702e-13937.78%GF11347 [Drosophila ananassae]
NCBI nr blastxgi|3800131850.034.59%PREDICTED: uncharacterized protein LOC100867677 isoform 1 [Apis florea]
Group
Gene OntologyGO:00055153.5e-07protein binding
GO:00082703.5e-07zinc ion binding
GO:00036769.8e-05nucleic acid binding
KEGG pathway 
InterPro domain[1690-1774] IPR0110111.9e-20Zinc finger, FYVE/PHD-type
[1719-1778] IPR0130831.8e-17Zinc finger, RING/FYVE/PHD-type
[1727-1774] IPR0019653.5e-07Zinc finger, PHD-type
Orthology groupMCL19366 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212418-TA
ATGGAAATAAATCCTTTGTATTGTATTCGTACTGGACGTACGATTTCAGAGAAAACTATGGAAGATGGTAATATTACCAATATTCAAGACGATAAAATCGCTGCAGCATCTTTGGCAAATCTCTATAGTGAACGGCCCAGTAAACTAACGACAAACGGACATGATTTCGCTGGAGAGTCGTCTAGTTCACCCGACCAGTCTTTTGGCACGCGATGCCATAGCGCTGACACATTACAAGAAGATAAGTTCAGGAAACGCATTAATCCGCTTAAAATACACTTGGGTAAAAGACCTTTTGTGGATAATTCAGTTTCCTATAGTGTGAATAAAAAGCGCAAGCCGTTCGTTAACAAAACCAGTGACACACCGAAGAATATACCCCCTAGAGTACCTAAGTCAATACCGAAACGCTTGGAAAACAGTCATTTTCAAAATGTTGCATCACACAGTGGACTGGAGAGTGAAAAGGACAAACCTGTGCCTCGTCTCGTGCCTAAGAGTCAGTTATCACAATATAAAAAGGGTAAAGTGGAAAAGGGAGCCCAGTTGAAAGAGTATGCTCAAAGCTTAGGATTACAGCCCATGGTAAAATTTAAATGTCGAAAATGTTGCTCCATGTCATTCAAAACTCTGGCTATGTTAAAAGAACATCAACTGGTGTGTCGGGCGCAGGCTAAGGAACCTGCACTCAGTCCCGAGGCAGCACCCAATACAGAAATGAACAGCAGTGGTCCTTCGAGAGTCACTCGCAAGGTCTATTTGTGTTCCGCATGTGGAACCTACTATGAGAATTGGAATCTCTTCCTCCACATGCGGGAAGTACACAACAGACACATTTGCCTGTTCTGTTTGGGAATGTTTTCCAAAGCTGAGAAACTCTCATCTCACTTAACAAATAAACACTATGTGAAAGAGTTCAATTTCGACAGCAAGGAGGAATTCTCTAAAGTTTACAATGGAACATTTTATTTAATGTGCTGTGCCTGTGATGAAATATGTAGTGAAAAAGATGATTTTTTCAATCATGACTGTAACAATGTGAAGCCTAAACCTGTGCCTGTTCTTATTAGTAAGAATTCAATGAAGACCAATTCCAAAATTTGTAAAAGTACGCATGAAACTTCACACAATGGACCCGGAGTAGTGAATAAAGATACTAAATCTGTACATAGCAAAAAAATTATAAGCGAGCCCAAACCTGACAAACAAACTACACATTTACAGAGACCAATGAGTTTTATGAACAATGATTCTATCAACACCGATACAAGTATTAGGAGCAACAGTTCAATAGCTGTTAGTAAGGAGACCAGTATCCAGTTGGACTCGGCAAATGAGAATAATAACATAACTGAGAATTCAACAATAAATTCCATTGATAATTTATTGAATAATACGACAGAAGCCATTACACCTTCGGTTCCTAATGGCAATGAACCCTCTGAGGAGTCCGAAGATAGCGACACACCTGAATCAGAGAGCCTAGAACCACCAGCACCTGACTCTACATCCTTGAGGCCTAGAGAACCATCTCCTCCACCGGTAGAGAAAGAAAAAGAGGGTGGTATTAAATTGAAATTGAGTCTTAATAATGCTAATTCCCCAGTGATTATTAATTCCACTGTGGATCCCACCGTGGGCCAGCAGTATGCTCACAAGCCCCCTTCACGGAACAGAAGACCTCCTAAAAGATATGAGAAAGATAAGCCTCCCGATCCTGCACCTCCCATAAAGCCTCCCTCAATTAAGATGACTATATGTGATAAAGCTGACAGCTCATTTGTAAAGACGACTATCATAGAAAATAATAAAGAAACTATAATGAAGACGCTTGTAAATAATAATGTGGAGTCCATAAGCAAACCTCCCGAACATAAGACGGACACCGCAATCAAGTCTACCATTTACGATAACAAGCTAGAAGATCTTAATACTTTAAATAGGGTACCAAAGCTAACTGTACGAGTTCCTAAAGAGTTTCTAGACAAAGATTCTTCTAGTGATTATTCTTCGGATAGCGATAGTGGCGAAAAGTGTAACGGCGAGAGCAAAGAGAGTGTGGAAGAGCCATGTCCAGAAATGGAAGAACCGAAATGTGAAGTGACTCCGACACTACCCGACGATGTTAGAAAAGAAGAATCTACGGAAAATGAAAACGCCCATGAGCCTCCAGTAGACGAAACCATCAAAAACGAGGAACAAACCCTGGTTCAAGTTGAGCCGAAAGTAGAGGAAGCGGAGCAGACTGTTCCTGACGAAATTAATGTAGAGGAGCCGGATGAAAGTGACAATGTACCTGTAACAGAATTGACATTAGATAGGCCAATTGATAAATATCCATTAAAGGACCTCCTTAAAGTGTTTCTGGCTTCAACTGTCATAAATTGTATCTATTGTAATCATGCACGTAAGATAGCAGTGAATTGTGAGCAACTGGCACTCCATATGGTGGCCGAGCATCGGTTCTCGGCCACGGTGAACAGTATAACTGCCGAGGAGTTAATGCCAGAAACGATCACGGCCAAAGTTAAAGCTGGGGCCCCGGAGTTGTGTAAAGTTTATATTAATTTAGACTCTTACGACAGTGTTGACAAGTGTGAGACGGTTCAGAACAATCAACTTTTCGAATGTTTTCAGTGTTATTTTAGAACTGCTGTTCATAAAGATCTCTATTTACATAATCGTAAAATGCACCAGAAGACAATTTTACTGTGTGTCATGTGTAAAAATAATTTTTATTCTTATAGTGAGTTGTTATGTCATTTATGCCCTGGAACATATGATTCTGAATATGAAATTAAGTTTAGATGCTGTTTTTGTAACGTAGACAACATTCTGTCGACGTTTAGGTTGATGGTGCATCTTAGGAAGATACACCACACTTGCGATGTGTGTTTAGAATTCTGCCAAAGTCAAGCACGCCTATCTAACCACGTGTGGAAACATAAGCTGCATCATTTGTGCTACCGTTGTGATATAGCCTATAGGAATAAGCCAGACATTACTAATCACTTGTTTTGGAAACATGGCACCGAAAGTGTGTTATGTAAAAGGTGTCTGCAGAAGAAATGGCCGCACGTGTACCATTTCTGCACCCCTCCCGCTGTATTCGTTTGCGACGAATGTACTCTTCAGTTCACTAGAGCTGTTTGTCTCAAAGTCCATAAGAGATTTCATTCAGAAGAATATCCACATGTGTGCATTGAGGAAGGTTGCACGGAAAAGTTTGTATCGAAGAAATTGCTTAATAAGCACTCCGAGGAACATGGAAAGAAATTAGTTAAAGAAGACCTTAAAGATACTAAACCGTTAGAATCAAATGATGCACCACCAGAAGATCAGAAAGACCCTATTCCCGTCATAGATCTTGTTAACGATAAACCAAAAGAGGAAGCTGGATCTTCCGAGGTTAAAGCTGAGGCCGAGACTGAACTGTCTTCTAAGAAAGTCAAAAAGAAGAAGGTGAAAGACAAGGATGCTTTGTTGTTAGATGTCAATTTGCCTGCATTAAATCTGTCTGAAAGTGATAGCGACGACTCGGACAGTAATTCAATCCAACCAACAAAAGAAATTGATTCAGAAGATAAACCTAAACTTGAACCAGTTGAAGATAATGCTGTAAATCCCGTAAATAACGAAAAAGATGCCACTGACGGAAATGTTGAAATAACTTCGAATAAAGTAGAGGAGACACCAGAGTCATTAGAAGATAAGACGAATGAGCAGCAAGTATTAGATATATGGGATAACTTCAAAAAGTATCAGGCTAAGGTGGAGAAGCAAAAGGAAAAAACGCCCCCACTCGTTCCTATCAGAAAACACGTTTGTGAATCGGACCATGACTATTGCGTTATACCTACGGAAGTCAACGGAGACGATGAATCCTTTGATAAGAGAAAAAACAAAAAATCCCCAAAGAAAAAACACGGCGGCCTGTCATCATCTAGTAGTAGCAGTAGCGACAGTGATTCCAGCTGTTCCTGTGGATCGAACTGTAGTTGTTCTTCGAGCAGCGGTTCCTCATCGTCCAGCTCGTCAGATTCCGATTCATCAGACGAATCTGGAAACGAAAAGAAAAAGAACAAACAAATGAAGAAGTCACTTCCAAACAGAAGAATGAGTAACGGTTCTAATGTTGATGTAATGGGTATGTCGGAGACCCCCATACTTGTACCGGAAAAGACTGAGCCAGCCATCGCCGAGAGCGACCTGGAAACGGATGAAAGTGAAACAGATGAAGAATTCTATGACAAAAATCCTCAACAAATTGCCAATAAATTACACAACGAAAAACGAAACCAACTGTTGTTGTTAGCATCAGTCGCTCCGTCTGACGGAGGTTCCGTGTCTGGAGATGTAAGCCGCTGTAACACACCAGTCAAAGAAGAAGAACCAGAGAAGCAGAAAGAAGAGGTAAAAGACGAAGACATTAAGGAAACCGAAGTGAAACAGGAATCTGAGGCAAAAGATAAAAGTAGCAGCGGCAAAAAGAAGAGTAAAAAGAAAAAGAAATCGAGAAGTTCACGCAAACACGGCCCACTTAAGATGATAATTCCCAAGGATGTCATTAGTAAACCAGAAGAAGAGATCCCACCGCCGTTGATTCCGAATAAAATAACTATATCATTACAATCGAATACCGTTCAACTGCCGGAAACTCCCAAAACAGCCTCGATCCCGCCACCCAAGAACTCGAGTACTCCCGCGACCACCATAGAGAGAAAGAGGGCGTCCAAAAGAAGAAGGGTGCCGAACAGGTTCTACGGCTATTCCAGTGACGAAGAGGCACCACAGACTCCTGCTGCATTAAAACCTCAACTGCCTCCCAAATTAGAGTGGCGGAAGGAGGACCTACCATCGCCGGTCACGCACAAACCCAAAAAAGAAATTCCGTCTACCGTAACTCCTCAGAGAATGTTCAACTACACGGAACCCATACGACTGACGGCTCCCATACCAGATCCGGAACCGCCTCGGTTCCTCATGAATTCCGAATCTATGGAGTCCAGCGATTCCGAGTCCAGCACGGAACCAGCGTTAGAGATATTCCAACCGCCGGCTCCGAACCCTCCTCCCGTGACAGTTCCGCCGCCGACGTATCTGAACTCTGGCACTTCCAGCTTGCCGTACGCCTTCCAAAGGCCAGCGGCGCGGCAGGCGCGGGAAGGCGAGAGCGTGTACTGCTACTGCCGCTGCCCCTACGACGAGGTGTCGGAGATGATCGCGTGCGACGCCGAGGGCTGCCCCATCGAATGGTTCCACTTCGAGTGTGTCGGTATCATGGTACCGCCTAAAGGCAAATGGTACTGTCCGGAATGTAGGAAAAATCAAAGCGTCACAGGCTGCAGATAA

Protein sequence:

>DPOGS212418-PA
MEINPLYCIRTGRTISEKTMEDGNITNIQDDKIAAASLANLYSERPSKLTTNGHDFAGESSSSPDQSFGTRCHSADTLQEDKFRKRINPLKIHLGKRPFVDNSVSYSVNKKRKPFVNKTSDTPKNIPPRVPKSIPKRLENSHFQNVASHSGLESEKDKPVPRLVPKSQLSQYKKGKVEKGAQLKEYAQSLGLQPMVKFKCRKCCSMSFKTLAMLKEHQLVCRAQAKEPALSPEAAPNTEMNSSGPSRVTRKVYLCSACGTYYENWNLFLHMREVHNRHICLFCLGMFSKAEKLSSHLTNKHYVKEFNFDSKEEFSKVYNGTFYLMCCACDEICSEKDDFFNHDCNNVKPKPVPVLISKNSMKTNSKICKSTHETSHNGPGVVNKDTKSVHSKKIISEPKPDKQTTHLQRPMSFMNNDSINTDTSIRSNSSIAVSKETSIQLDSANENNNITENSTINSIDNLLNNTTEAITPSVPNGNEPSEESEDSDTPESESLEPPAPDSTSLRPREPSPPPVEKEKEGGIKLKLSLNNANSPVIINSTVDPTVGQQYAHKPPSRNRRPPKRYEKDKPPDPAPPIKPPSIKMTICDKADSSFVKTTIIENNKETIMKTLVNNNVESISKPPEHKTDTAIKSTIYDNKLEDLNTLNRVPKLTVRVPKEFLDKDSSSDYSSDSDSGEKCNGESKESVEEPCPEMEEPKCEVTPTLPDDVRKEESTENENAHEPPVDETIKNEEQTLVQVEPKVEEAEQTVPDEINVEEPDESDNVPVTELTLDRPIDKYPLKDLLKVFLASTVINCIYCNHARKIAVNCEQLALHMVAEHRFSATVNSITAEELMPETITAKVKAGAPELCKVYINLDSYDSVDKCETVQNNQLFECFQCYFRTAVHKDLYLHNRKMHQKTILLCVMCKNNFYSYSELLCHLCPGTYDSEYEIKFRCCFCNVDNILSTFRLMVHLRKIHHTCDVCLEFCQSQARLSNHVWKHKLHHLCYRCDIAYRNKPDITNHLFWKHGTESVLCKRCLQKKWPHVYHFCTPPAVFVCDECTLQFTRAVCLKVHKRFHSEEYPHVCIEEGCTEKFVSKKLLNKHSEEHGKKLVKEDLKDTKPLESNDAPPEDQKDPIPVIDLVNDKPKEEAGSSEVKAEAETELSSKKVKKKKVKDKDALLLDVNLPALNLSESDSDDSDSNSIQPTKEIDSEDKPKLEPVEDNAVNPVNNEKDATDGNVEITSNKVEETPESLEDKTNEQQVLDIWDNFKKYQAKVEKQKEKTPPLVPIRKHVCESDHDYCVIPTEVNGDDESFDKRKNKKSPKKKHGGLSSSSSSSSDSDSSCSCGSNCSCSSSSGSSSSSSSDSDSSDESGNEKKKNKQMKKSLPNRRMSNGSNVDVMGMSETPILVPEKTEPAIAESDLETDESETDEEFYDKNPQQIANKLHNEKRNQLLLLASVAPSDGGSVSGDVSRCNTPVKEEEPEKQKEEVKDEDIKETEVKQESEAKDKSSSGKKKSKKKKKSRSSRKHGPLKMIIPKDVISKPEEEIPPPLIPNKITISLQSNTVQLPETPKTASIPPPKNSSTPATTIERKRASKRRRVPNRFYGYSSDEEAPQTPAALKPQLPPKLEWRKEDLPSPVTHKPKKEIPSTVTPQRMFNYTEPIRLTAPIPDPEPPRFLMNSESMESSDSESSTEPALEIFQPPAPNPPPVTVPPPTYLNSGTSSLPYAFQRPAARQAREGESVYCYCRCPYDEVSEMIACDAEGCPIEWFHFECVGIMVPPKGKWYCPECRKNQSVTGCR-