Monarch geneset OGS2.0

DPOGS202167
TranscriptDPOGS202167-TA4890 bp
ProteinDPOGS202167-PA1629 aa
Genomic positionDPSCF300162 + 67466-73431
RNAseq coverage451x (Rank: top 27%)
Annotation
HeliconiusHMEL0037000.071.01% 
BombyxBGIBMGA003310-TA0.056.49% 
Drosophilacrol-PE2e-3425.37% 
EBI UniRef50UniRef50_F1QEE57e-4622.81%Uncharacterized protein (Fragment) n=2 Tax=Danio rerio RepID=F1QEE5_DANRE
NCBI RefSeqXP_001945749.13e-5222.44%PREDICTED: similar to mCG7830 [Acyrthosiphon pisum]
NCBI nr blastpgi|3266674652e-5522.90%PREDICTED: hypothetical protein LOC571721 [Danio rerio]
NCBI nr blastxgi|3266674652e-8122.26%PREDICTED: hypothetical protein LOC571721 [Danio rerio]
Group
Gene OntologyGO:00056343.2e-16nucleus
GO:00082703.2e-16zinc ion binding
KEGG pathway 
InterPro domain[8-78] IPR0129343.2e-16Zinc finger, AD-type
Orthology groupMCL26517 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202167-TA
ATGGCACTCAAGCTAGGAAAATGCAGATTGTGCCTGAAACTGGGTGACTTCTATTCGATCTTCACTATGGATAACAATTTGCAGTTAGCGGAGATGGTAATGGAATGTGCTCGAGTTAAAATATATGAAGGTGACGGACTTCCGGACAAAATTTGTAGTGAATGTATACAGAAACTCAGCAGCGCTCACATTTTTAAACAGCAATGTGAGAGATCAGATCAAGAGTTGAGACGCAATTATATTCCACCACCAGCTTTCAGTTCTACACCTCCACCGCCAAATAGACAAAGCAGTGACTCCGCAATTTCAACTCACCTGGAAGTTTCAAAGCCTTCGTCTTCAAATGATAGTAAAATGTCGATCGAGAGCAAACTCACTCCCATCAGTCGTAACAGAAAACGCAGTAAGGATAGTATGGGTGACGCATCAACCAGTAGTGACTGTAGACCGGGCAGTTCCAAAAGAGTCGAAGAATTAAGAAGGACACGCAAGAAGCCAAAAATGCTCTCGAATTACGATTCCGACTTCGATGACAGTGGCTCATTTTACTCTCAGGAAACCGATTCAGACGATCCGCTACTGTACAAATGTGATATATGTTCCAAAGCATTCAAGTCAAAGAACAGTCTCTCGGGACATTCTAAATGTCATAAAAGAAAAAATGCATTAAAATATGACTCGGTGACCAAAGAGGATTCAATGTTGAATGCATATGTGCCAGAATTATCGAATGTGAGAGACGCTCACGACGATGACGATAAACTGAAATGCGAAAAATGCGGGAAAGAATTCAAATTGAAGATAATGCTCAAAAGACACAACGAGATCTGTAGCAGACCACCGATGAAAGAGCTTTTGATATCTCTAGAGCCGATCAACATTACACGTAAAAGAAACAGACTGGATTGCGAGCTGTGTTCCTTGAAATCTGGAACTGTGGAGGGCTTGCAGGAGCACATGAAGCTCGAACACGCCATGGAACTCGACAAGGACAAGGTGTGCATGAGGGATCGCGACGGAAAAATATGTGTTCCGTGCTGCTACTGTGAAGAGAATATCGACGACTTCTATAAGTACACCGCTCATATCGGCGAATGTACCAAGAAGGGCAATGCCGCGGACATCGTGTGCCCCGTGTGCAAACAGACGACGACGAAATCCAATTACTTGGTACACGTTAAGCTGCATTTCTTTCCGACTCGGACGATTGAATCTGGTGCTACTAAAGAAAATTTCCAGTGCAGAATGTGCAACAAAGAGCTGCCGAGCCAGGAGTTACTGATCAAACACCTGGCTGCTCACATGTCCAATATAGATGACGCCGATGAGGGGGGCGACGAGGAATCTCGAGCGAGTACAGTTGAAGACTGTGGGTCGATACATTCTGAATACAGCATAAATACTCCAAAAACAACCTTGCAGTGTCAACATTGTGATAAGACGTTTAAATATAAAAAAGCCTTACAATCTCACGAGGAGAAGCATAGGCGTGAAGTGAAAATAGAAGGTCCCGAAACACATCAGTCAGCAGATAGCATCAACGTAGTGGACCCATCATTCGCTCAGTACGACTCTGACACTAGTCAAGAAGACGGCGAAGACGATAACACGTGTGATATTTGTGAAAAACAATTTTCCTACAAGAGACAGCTGTTGCAGCACAAGAGAACCAAGCACCATATGACGTCCGGCACCAAGAGGGCGAAGATTAACCTGAAGGACTGTTCGGTCCGATGCTTGATATGCGATATAGAGATGAAGGTGAGCGCGATCAACGAGCACAACCAGACGCACATCTCAGTGAACATCAAGCCCAAGAACCAGTACACGTGTATACAATGCACTGAACAGTTCAAGAGCTGCAGCAATCTGGCCAATCACATCAAGCTGATTCACAGACTGAAACAGCAGCCGATGGATTCGAAAATGAGAGCCGATTTGGCGGATTTTTGTGAAGTCGTTGTGACCAAGGCGGAACCCCTGGACGAGCTCCAGAATCACAACGGCGTCAATGAAAATTCCGCCACCGATGTTAAACCTTTAGTCAACATGAGCGGATTCAGCTGTCCCACTTGCAACAAAACTCTGCCCACTCTGATATCACTTAAGCGGCATATCAACTGGCACAATAATGTTGGTAAGAACATGGAAAAGAAATTGGAATGCTTTGTATGTAAAGAGACCTTCCGATTCCAATGTCATTACAAACTGCACATGCGCGATCACTACAAGGACACGAATCTAGACCCGGCCCTACTGACCTGCAACATCTGCAACAGGAAAAGCAAGCACCTCCGGGCCGCTCAGGCACACATGAATTTCCATAAACAGACTCGCTTCCAGAGCAAGGATTACGAATGTTCGATATGCAAGAGAGTGTTCCAGCATCGGAAGGTGTACCTCTCGCATATGGCGATACACTACAAACGCGGCGAGAGCACCAGCAACACTGTAGTCGGAGCCGAGTTGCCCAATACGGTGGATAAAAACGTCTTTGACGGAACCTACAGCTGCCACCTCTGCGGGAAGGTCTGCGATTCGGAAACCTCGTTGAAACACCACGTGATCTGGCACAGCTCGAAGACGTCCCTGTACGGCGCTCGCCATCAGTGCGATATCTGCAATTTGCAGTTCACTAACAAGAAACGTCTCGAGCTCCATACTAGATCGCATTTCGAAGACGACAACGGACCTTTCAAGTGCCACATCTGTGGGAAAGGATATCTAGTCGAAGATTACTTCAAGAGACACGTGAAGGGGCATAACTTCGATCATCAGTCGCATAAAAAGAGGATAGAGAGGCTCAGGAAAGACAAAGTGAAATGTCCGATTTGCTCGCGATACTATCCGGACCTGGGGAAACTGATCCGGCACCTGCGGCGCACTCACCCGGAGAGCAAAATGATCAAACAGGACCCAGACGCCCCAACGCCTCGCTATTATTCTTGCAAGCTATGTGCGAAGGTCTTCTTGGACGAGCGGAGGTTGCAATACCACGAGGAAGCCCATCTCAGAAAACCAGAGTTTTTCAAATGCAAGTTCTGTGGAAAGAAAACAATCTCCCTGAAAAATCATAGGGTTCACATAAAGGGTCACTTGACACAGAAGTACATCGATAATCCTCTGAAATGTAGCCACTGCGAAGAAACATTTACACGCGGCTACGACCTCCAATACCACCTTCGAGACGCTCACGGCGTCAACGAGACGTGGATAGCGGAACGCGGCGTGCAGACTCCCGACGGACCGCTCAAGGAGTTCCAATGCTCCATATGCTTTAAAATATTGGCCAGTAAAGGAAACTTCGAACGACACATCGACTATCACAATTCGCTCCGATGCAATTACTGTTTCGAGTACTTCGGTAGTTCCAGGTTTCTGGAGGGGCATCTCACCTTCAGCTGCGATAAGAAGAAACTCCTCGGCGACACCGAGATCTACCCCAAGAAGGTCAAGTGCCATATATGTTACAAGGCTTTCCATTTGCAAGTCAAGTTAGACTGTCATTTGCGAACCCAGCACGACATAAGAACGTTCAAAGAGGCGTTCGAAGGGAAAAAGGAAATCGTATGCGATTACTGTTTCAAAGTGTTCGAAAACGAATACGCTCTCAGCACGCACAAGATCTACCACCGCACTGTCGGGTACTACGGCTGTATCTACTGCAACAGGAAATTCAATACCATGACCCTGTACAGGAAACATAAGAATCACCACTTCTCCCAACTCAATGTGGACAACCCGACCAAGTGTGAACACTGCGATGAAACTTTCGTGGCCTTCAGGAAGATGATCTACCATATGAGAGACGTCCACGGCGACCACAAGGAGTGGATCGTGTTGCCAAAGGAATCCAAACAAGAGAAATGCAACATTTGTAACAAAACGTTCTTCAACCTTCATAGACATCTGGATTATCACGAGGAGAACAAGTGTCAGAAGTGCGGGGAGTACTTCTACTCGCGGGCGGACTTCGACAATCATCTCTGTGCTATAGACAGCGAGGAGGAAGTCGCCGACACTAACACTACCGGCGATCGCTGCCAGTACGAGGAGTGCGAGTTCTGCTTTAAACCAGTCACAAAGAAAAACTCAAAGAAAATGCATCTCCAAATCCATAGAGGCTCCGGTTCTATATCGTGTCGATTCTGCGACCTCAAGTTCAAGACGATGGACGCGTTCAATATACACGCGTTTTCGCACAGGAGCAGGAAATACAAAAAGAGACCCATCAAGTGTAGAAAGTGCGGTGAGCAGTTCGTCAAGTACGGCCCCTTCATCCGACACATGAAGTTTGTTCACAAATCACTCAAGAAGCTGCACTACAGAGCCACCGTGATGCCAGAGCAGTGCGTGGTCTGCAAGCAAGACTTCCCCAACCTGCACAACCACTATCGAGCTCATCTACAGAACCAGTGCCATCTGTGTCTCAAGTACTTCACATCTTCAAAGTTATTTTCGTTGCATCAATGCGACAAGGAGGAGTCTGATCCGACCAAAGTGTTCACATCCGACGCCAACTTGACGGAGCTGATCAACTCCTATGTGCCGAGAGACGAAAAAGACGACGAGAAATATTACGGATACGAAGACGAAGGCGAGAACTTGGACGAGAAAGCGAATGAGAAAACGGAAGTGACGTCAAACGTGCCATCGCAGGACGAGGACAGTCAGGGCTCTCTAAATGTAGAGGAAAAGAAAGTACACTCGTTGGTGCACGCGCCCATTATATCAGACGTTCTGTCGCTGTATAAAAATAAATGTAGCAAAAACAGCATCCGGACTAAAGGTGACCAGAACAGCGTCGGTGGGAGCGTTGTGGTGCTCACGGACGAAGAGTCCGCGGACTACGAGTCTAACGAAGCCTCCGTCATCACAATAGACGACTAG

Protein sequence:

>DPOGS202167-PA
MALKLGKCRLCLKLGDFYSIFTMDNNLQLAEMVMECARVKIYEGDGLPDKICSECIQKLSSAHIFKQQCERSDQELRRNYIPPPAFSSTPPPPNRQSSDSAISTHLEVSKPSSSNDSKMSIESKLTPISRNRKRSKDSMGDASTSSDCRPGSSKRVEELRRTRKKPKMLSNYDSDFDDSGSFYSQETDSDDPLLYKCDICSKAFKSKNSLSGHSKCHKRKNALKYDSVTKEDSMLNAYVPELSNVRDAHDDDDKLKCEKCGKEFKLKIMLKRHNEICSRPPMKELLISLEPINITRKRNRLDCELCSLKSGTVEGLQEHMKLEHAMELDKDKVCMRDRDGKICVPCCYCEENIDDFYKYTAHIGECTKKGNAADIVCPVCKQTTTKSNYLVHVKLHFFPTRTIESGATKENFQCRMCNKELPSQELLIKHLAAHMSNIDDADEGGDEESRASTVEDCGSIHSEYSINTPKTTLQCQHCDKTFKYKKALQSHEEKHRREVKIEGPETHQSADSINVVDPSFAQYDSDTSQEDGEDDNTCDICEKQFSYKRQLLQHKRTKHHMTSGTKRAKINLKDCSVRCLICDIEMKVSAINEHNQTHISVNIKPKNQYTCIQCTEQFKSCSNLANHIKLIHRLKQQPMDSKMRADLADFCEVVVTKAEPLDELQNHNGVNENSATDVKPLVNMSGFSCPTCNKTLPTLISLKRHINWHNNVGKNMEKKLECFVCKETFRFQCHYKLHMRDHYKDTNLDPALLTCNICNRKSKHLRAAQAHMNFHKQTRFQSKDYECSICKRVFQHRKVYLSHMAIHYKRGESTSNTVVGAELPNTVDKNVFDGTYSCHLCGKVCDSETSLKHHVIWHSSKTSLYGARHQCDICNLQFTNKKRLELHTRSHFEDDNGPFKCHICGKGYLVEDYFKRHVKGHNFDHQSHKKRIERLRKDKVKCPICSRYYPDLGKLIRHLRRTHPESKMIKQDPDAPTPRYYSCKLCAKVFLDERRLQYHEEAHLRKPEFFKCKFCGKKTISLKNHRVHIKGHLTQKYIDNPLKCSHCEETFTRGYDLQYHLRDAHGVNETWIAERGVQTPDGPLKEFQCSICFKILASKGNFERHIDYHNSLRCNYCFEYFGSSRFLEGHLTFSCDKKKLLGDTEIYPKKVKCHICYKAFHLQVKLDCHLRTQHDIRTFKEAFEGKKEIVCDYCFKVFENEYALSTHKIYHRTVGYYGCIYCNRKFNTMTLYRKHKNHHFSQLNVDNPTKCEHCDETFVAFRKMIYHMRDVHGDHKEWIVLPKESKQEKCNICNKTFFNLHRHLDYHEENKCQKCGEYFYSRADFDNHLCAIDSEEEVADTNTTGDRCQYEECEFCFKPVTKKNSKKMHLQIHRGSGSISCRFCDLKFKTMDAFNIHAFSHRSRKYKKRPIKCRKCGEQFVKYGPFIRHMKFVHKSLKKLHYRATVMPEQCVVCKQDFPNLHNHYRAHLQNQCHLCLKYFTSSKLFSLHQCDKEESDPTKVFTSDANLTELINSYVPRDEKDDEKYYGYEDEGENLDEKANEKTEVTSNVPSQDEDSQGSLNVEEKKVHSLVHAPIISDVLSLYKNKCSKNSIRTKGDQNSVGGSVVVLTDEESADYESNEASVITIDD-