Monarch geneset OGS2.0

DPOGS205368
TranscriptDPOGS205368-TA3486 bp
ProteinDPOGS205368-PA1161 aa
Genomic positionDPSCF300373 - 119068-124679
RNAseq coverage100x (Rank: top 61%)
Annotation
HeliconiusHMEL0134450.064.19% 
BombyxBGIBMGA008761-TA0.048.87% 
Drosophilacrol-PE2e-3428.12% 
EBI UniRef50UniRef50_UPI00016E15C21e-5224.32%UPI00016E15C2 related cluster n=14 Tax=Takifugu rubripes RepID=UPI00016E15C2
NCBI RefSeqXP_001945749.12e-4622.87%PREDICTED: similar to mCG7830 [Acyrthosiphon pisum]
NCBI nr blastpgi|3266671109e-6324.31%PREDICTED: zinc finger protein 729-like, partial [Danio rerio]
NCBI nr blastxgi|3266671102e-8224.12%PREDICTED: zinc finger protein 729-like, partial [Danio rerio]
Group
Gene OntologyGO:00036766.2e-11nucleic acid binding
GO:00082704.1e-05zinc ion binding
GO:00056224.1e-05intracellular
KEGG pathway 
InterPro domain[1069-1103] IPR0130876.2e-11Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL18340 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205368-TA
ATGGAGCAAGAGCACACTAGTTTCCATTTGGATTTAACGTTTAGGCATTTGACGAAAGCGGAATTCGTCAAGGTAGACATATCGACCATACACTGTAGGATATGCGCGAGAGCTTTCGAGAGTTTGGAACTTGTTGCCGCCCATCTCAAGCTGGACCACAGGATAGCGTTGCACATGGAATCGAAGATCGGTGTTATACCTTTCGAGCTGAAGGGTGACAAGGAATGGATTTGTTGCGTTTGCTCCAAAAATGTACCTTCGCTATTGCTGCTAAGCAAACACACGGTGTCGCACTTTCAGAATTTCGTATGTGATCTTTGCGGGAAGAGCTACATTTCTGCCACCGGTCTATTAAAACACGTGAAATCCAAGCATCATGACGAATATAAGGTACATTGTAGACGATGTCGAACCACATTCTCGTCGCTGAAGGAAAAGATCTGTCACCAGAGGACTGTTAAGCATTGCATGCCGCATGTTTGCATGGAATGTCCAGAAAGATTTCCCACTTGGGAGACGAAACAGCGCCACTTGGTCCAAGTTCACGGTAGAAGCAAGAAGTCCTACGCCTGCAGCCACTGTGACGTCAAATACTCTGATCGTCGATCGTTCTACGATCACTATAAAAAATATCACACTAAAGATTGCCTTATCTGTATACACTGTGGGTTGAAATTCTCTTGCACCTCTAGGCTCAACCAAGTGGAATATGTAGTCATCGCTCGGAACAACGCTAAACTAGTTTTGGAAAACTCAACCATGTACCCGTTTAGGATACCAACTGTTTCACTTGTTTGCGTCTTTTGCTGCGAATCATATGATGATCCCAGGGAGTATAGGAATCATATAGATGAAGAGCATCCGTCGTTTAATGTTGCGAAAGCATTCGCCCACTCTGTCAACAATTATACAGAGTTCCTGAAGGTAGACTGCGTGAATCTGTCGTGTCGTATTTGCTATCAGCCTTTTTCAAATATAGATTCGATTGCGAAACATTTACGGGATGATCACGGAATTGACATCGATCCGAACGTTGAAGTTGGGATGCAGATGTTCAAACTCGGTCCAGAGCTGTGGGTTTGCGCTTTATGTGACACGAGGTTGCCGACCTTACGAAAGCTGAGTCGCCATACGAACAGCCATTATCACAAATACACGTGCGAAACCTGCGGGAAATCTTATATTAATAAAGAGAATTTACATAAGCACATAAAGTATGCACACAGTGAAGAGAAGATATGCAAATGCAACAAGCTTTTTAAAACTACTGAAGAACGAAAAGCTCACTTCATTGCATCTGAGAAGTGTTGGCGCTACAGCTGCAACCTTTGCTTCAAACGTTTTATGTCACATAAGGCTAAAAACGACCATTTAATTGAGGATCATGGGCATACACTCAAGTCGTACTCTTGCTCTGAATGCGATAAGATCTTCGATAAATGGCACACGAGCTTATTTCTGTTTGTTCTAGAAGTGCATTGCTTAGTCTCAGCCCGAAACAACGCGAAGGTAGTTTTGAAATACTCCACAGCGTATCCGTTTAGGATACCGAGTCATTCAATGGTTTGCGTCTACTGCTGTGAAGCATATGACGATCCAAAGATATACCGACAGCATATGGACCAGGAGCACGCTTCATTCAACGTGAACACAGCGTTCGCGCACTCTGGACACAACTGTACAGAGTTTCTCAAAGTAGATTGCACGGAATTATCATGTAGGATATGTGAGCAACCCTTCAACACGATCGATGCTGTAGCTGAACATTTATCTAAGATACATGAAAAGAACGTGGATTTGAAAAACGATGTCGGAATGCAGATGTTTAAACTGGGTGCGGAAAGATGGGTCTGTGCGATTTGTAATATAAAACTGCCCAGCCTTCGGGAGCTGAGTCGACACACGAGCTGTCATTATCACAGATATACTTGCGAGACTTGCGGTAAATCGTATATAAATAAAGAAAACCTTCAACGTCACATACAGTTCGTGCATAGCAAATATAAAATTTGCATTAAATGTAAGAAGACGTTCCCGACTTGTGAAGAACGGAGGGAGCATGTATTATCCTCAGAGCGGTGTTGGCCACAGAGTTGTAGCGTATGTGGCAAGAGATTTGTGTCTAGAAATTTAAAGCTTGATCATTTGGCCAAAGAACACGGCCAAAAGCCCAAGTCCTATACATGTCCGGAATGTGGAATAGTCTTCGACAAATGGCATCCGTATCGCGCCCATTTCATATTGGCTCACACGAATGATAACTATAGCTGTTCACATTGCGGTCTTAAGTTTGATAGAAAGAAATCCCTGGAGGAGCATAAAGTAATGCACACAAAAGAAAAATCTTTTCATTGCGGCGATTGCGGGAAGTCATTCGCGCACAAAAATGATGCCCATTTCATGGCGAGGCGCAACGCAGAGTTTATCGTGAGATTTTCAACAGCGTACCCGTTCAGGCTTCCAGAGGATTCCATGGTTTGTGTGTATTGCTGCGATAGCTATAGCGATCCCGCTATGTATAGGCGTCATATGGAAGAGGAGCATCAGAATTTCAATGTCCGAATGGCCTTCGTTCATTGTAGCGAAGGCTATATCAAAGTGGATTGCACGGAACTTCGCTGTAGACTCTGCTCTGAACCATTCGATGCGCTAGAAGATGTCGCCCAGCACTTGTTTCATAAACATGAACAGCCTTTAAATCTCTCGTTTGAACTTGGCATGCAACCATTTAAATTGGAAAAGGATAAACTGATTTGCGCAATATGCCGTGCAAAATCCCTATGCCTTAGACAGTTAAGCAGACATACGCAGACTCACTTCTTAAAATATACATGTGAGGCGTGCGGAAAATCTTATGCCACAATGACTCCTTTGAAGCATCACATAACTTACTCGCATACGGGTCAGGAAAGGATATGTAGGAAATGTAAAAAGACATTTAGTTCCTTAACTGAGAAACGCCAGCACTTGCAGGACTCAAAGTCCTGTTGGTCGCATTTGTGCAACATTTGTGGCGAAAGATTTCTCAGTTGGACCATAAAACAAGCACATTTGACAGAAGTACACGGCGCTCCAAAGAGGACTTATGTCTGCCCCGAATGCTTGGAGGTTTTCCCAGATAGAAAAAAATTTCGTGTCCATTTTAAAATTTTACACACAGATGACAATTTTGTTTGTACTTGTTGCGGTCTTAAGTTCGATACTAAAAGAAATTTAGAAAATCACAGAGTCGTCCACACAAAAGAGAAATTATTCCCTTGTCCAGTTTGTTCGAAATCATTTCCCAGAAAGAAGAATTTAGTTCAACATATGTGGATTCACAGCGAGCTGAAGAGATTCAGCTGTACTTTGTGCAATAAACAGTTCAATCAGAGAGTCAGCTGGAAAACACATATGAAATATTACCATCCAGATCTAGTTAATTACGACGGAATGCAGAACAATAATGCGAAAATGGTGCTAACTGTTCTAAGAAATGATGAATAA

Protein sequence:

>DPOGS205368-PA
MEQEHTSFHLDLTFRHLTKAEFVKVDISTIHCRICARAFESLELVAAHLKLDHRIALHMESKIGVIPFELKGDKEWICCVCSKNVPSLLLLSKHTVSHFQNFVCDLCGKSYISATGLLKHVKSKHHDEYKVHCRRCRTTFSSLKEKICHQRTVKHCMPHVCMECPERFPTWETKQRHLVQVHGRSKKSYACSHCDVKYSDRRSFYDHYKKYHTKDCLICIHCGLKFSCTSRLNQVEYVVIARNNAKLVLENSTMYPFRIPTVSLVCVFCCESYDDPREYRNHIDEEHPSFNVAKAFAHSVNNYTEFLKVDCVNLSCRICYQPFSNIDSIAKHLRDDHGIDIDPNVEVGMQMFKLGPELWVCALCDTRLPTLRKLSRHTNSHYHKYTCETCGKSYINKENLHKHIKYAHSEEKICKCNKLFKTTEERKAHFIASEKCWRYSCNLCFKRFMSHKAKNDHLIEDHGHTLKSYSCSECDKIFDKWHTSLFLFVLEVHCLVSARNNAKVVLKYSTAYPFRIPSHSMVCVYCCEAYDDPKIYRQHMDQEHASFNVNTAFAHSGHNCTEFLKVDCTELSCRICEQPFNTIDAVAEHLSKIHEKNVDLKNDVGMQMFKLGAERWVCAICNIKLPSLRELSRHTSCHYHRYTCETCGKSYINKENLQRHIQFVHSKYKICIKCKKTFPTCEERREHVLSSERCWPQSCSVCGKRFVSRNLKLDHLAKEHGQKPKSYTCPECGIVFDKWHPYRAHFILAHTNDNYSCSHCGLKFDRKKSLEEHKVMHTKEKSFHCGDCGKSFAHKNDAHFMARRNAEFIVRFSTAYPFRLPEDSMVCVYCCDSYSDPAMYRRHMEEEHQNFNVRMAFVHCSEGYIKVDCTELRCRLCSEPFDALEDVAQHLFHKHEQPLNLSFELGMQPFKLEKDKLICAICRAKSLCLRQLSRHTQTHFLKYTCEACGKSYATMTPLKHHITYSHTGQERICRKCKKTFSSLTEKRQHLQDSKSCWSHLCNICGERFLSWTIKQAHLTEVHGAPKRTYVCPECLEVFPDRKKFRVHFKILHTDDNFVCTCCGLKFDTKRNLENHRVVHTKEKLFPCPVCSKSFPRKKNLVQHMWIHSELKRFSCTLCNKQFNQRVSWKTHMKYYHPDLVNYDGMQNNNAKMVLTVLRNDE-