Monarch geneset OGS2.0

DPOGS212514
TranscriptDPOGS212514-TA4941 bp
ProteinDPOGS212514-PA1646 aa
Genomic positionDPSCF300222 + 438780-458614
RNAseq coverage21x (Rank: top 79%)
Annotation
HeliconiusHMEL0093295e-10779.83% 
BombyxBGIBMGA009783-TA9e-13546.19% 
DrosophilaCG5245-PA7e-2926.26% 
EBI UniRef50UniRef50_E7F8Z97e-6725.39%Uncharacterized protein n=61 Tax=Danio rerio RepID=E7F8Z9_DANRE
NCBI RefSeqXP_001945749.12e-6225.76%PREDICTED: similar to mCG7830 [Acyrthosiphon pisum]
NCBI nr blastpgi|3266671106e-6925.00%PREDICTED: zinc finger protein 729-like, partial [Danio rerio]
NCBI nr blastxgi|3343263862e-7924.66%PREDICTED: zinc finger protein 850-like [Monodelphis domestica]
Group
Gene OntologyGO:00036767.8e-12nucleic acid binding
KEGG pathway 
InterPro domain[1028-1051] IPR0130877.8e-12Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL22223 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212514-TA
ATGGAACCGCAATCACCTTTGATTCTCTGCATGGAAAATGTTGAAGGTCAGGAAGTGGCCCTAAAAATAGAGCCCAGTGACACATTTCAGATCTTTCTTGATAATTCTAAGTTACTACTAGGTTTTGAAGTTGATATAAACTCGATAACTGGCAACCAGCCGGTGTCACTTAGTGATAACATCTATACATTCCTACTTAACGCTGAAAAGAACGTAGCTGGTAATGACCTTGATCAGATTCTGGATCAAAACCCTGAGAGTGACGATCTGGTGTATGTTCTCGATGATGGTACCCAAATAAGGGCATCACAGATACAATTCGACAACGAAGATCCTTTAATAGATCTGACTGCTGAAAAAATACCATTCGTGAAATACAACGACGATGGTGATGAAGATGTTGCGGAAGTAGGGACTGTTAAAGATGTCAATATCATCGAGAGTCCGGTTACTAGGTGGAGCAAGGACTGCAGTCCCAAATGCAGTTTCGCCAACAGTCTGCCTTTCAAACTCGTATGCAATAACACTTCAAACTTCGACGCGCAGTTCTCGAAATACCTGGAGCCTACCAAAACATACACGACCTTGAATCCCGCCAATCGAAATAAATCACCGAGAAGCCTAATACGAGATAATTATAAGAATTACGATGAACGGAACGATTTCTTCACTAGAGAGGATATTTTGAACATGTTCAAAGATTCTCCGGTGACTTCCTTGCCATGTGATGGGCAGAACTATGAGAAGCGTAGACACGTGAGGAAAACTGACCCTTCCAGATCAGTACACAAGAGCTGGAATTACAAAACAACGCCGGACGTGGACAACATGAGCAATAACCAAGACTGCTTCATATGCGGAAAGATTGTGGAAAATAATGAGAAGCTATACCTATTCGACAAAGAAGATCAAATGTTGCATAGATGTGAGCAAAGGAAATTCCAGCAGCAATTGAAAATCATCTGCGAGAGATGTCTGAATGAGAATTTTAAACCTAGCAGAATGAAGAGTCCGAGCCAGTCGCTCAACAACGACGAATATCTGGTGATAAAGAACAACCAACAGTACATATTCCAAAAAATAACGACCATAGACTTGAAGAAGTTGAACGTGGAGTCAAATAAGAATTCAGAGTACGTCAAAGTCGAGATAGGTCCCGATGGGGAAATTATCACGAAGCCTATAGACAACGACGTAAATGACGTCATGGTGGTTAAAGATGAGAAAAAAGAAAGTTCTAGCGACGTTGAAATTATAGAACCGGAAGTGGAGATTGACATAGACAATATAGAGGAAGCGGATGAACAAGTGAAGGAGTTTTTGGGGAAGTACCAGTGTGATGCCACAGACAATAAAGAATTGAAATGCAGATTTTGCGAGCGCGTTTTCACAGAACTGGACCAGGTCATAGAGCATGGAGAGGAGCATAGACACGACGTAGAAGATGAAACAGTATTCCCATGCCCGTTATGTGACTACGGATACGCGAACTTCAAATATCTCAAAGCTCATCTAAAAGCGGCGCATATTAATAATAAAGACAGTAACAGTGAAGAGCACGACGATAAAAATACACCCAAATCATCTCCAGTAGCCAAAAGGACAAGAAGCGCTGTCAAAAAACACGACAACGAAAACGATGATAAAAATGAAAAGAACTCGGAAGAGACGAAGGCTAATAATACGCAGTTTACTACCGAGGTTAAGCAAGAGTGTGTTGAGAGTAGTGACGAGTCTATATGGATCGTACAAACTGGTGATGATGATGAACAGCTAAACAACTTGCTGCAGGTAAAAGACGACGAGCGTGATATGAACGATAGGAAGAAGCACAAGTGTTTCAACTGCAGAGCACATCACATCAGGCAATTCCACGACACAACCAATCTATTGACGCCCGACGGTCGCTACGTTTGTTCTGAGTGCGAGAACGCTGAATTCGCTACCGAGACAGAGCTATTTGATCATGTCCATTTCCAACACGACAAGCAGAAAAGATGGCAATGCCCCGTCAAGGATTGCGGAAAGACATTCTTCTTGAGGGCAACACTAACAAAACATAGCCGCACACACACAGACACAAGGAGGTATGTCTGTGTGACGTGCGGTAAACGTTTCTTAGACAAGCAGACATTGGACGAGCACGGCGTTACCCATTTACAGATAAAGCCATTCCAATGTCACATATGTCTTAAGCAATTAACCCGTCGGTCGCGGCTTCGGATGCACTTACGTGCCCACGAGGAAGAATTGTCACCAGCTCTAGTTTTGGTTTGCGCGATATGTAATAGGGCGTTTAGGGATCACGTCGATGCACAGGAGCACGCTACGAAATCATCAGAATGTATCGAAGAGTTCACCAATGAACTGAAGGAGGAGAAGGAGGCTTCAGAACAATTGTCTCCTACCAGTGGTCTAGTGAGACATACTGTCCATGTTGTGGCGGAATATTCACACAACAAGGCGATCGATCGCGAAGTGAACAACGCCGAGAGCGAGGCACTACTGTCGCCTTTGGACGACGTCGCAAAAAATATCATACGGGTCGTTAATATTGAGAAGGCGTTCAGATGCGAGTACTGCGAAGATGTATTCTATTTGGAAGACGCTTTAAACAATCACAGGGTTATTCACAAGGGTGTGCAGAATCCGTTTACCTGTCACATTTGTAAAGTCAGTTTCGCCACATATTCAAGTTGGAGTGTTGCGTATCACAGGAAGAAAGTTCACGGAAAATCCGGCCAAGACGAGAATGGGGGTATTACTAAAATTATAAGAGACAACAGGATCCCGTGTCGTGATTGCGACCAGACGCTGCCAAACAAAATTGAACTCTACAAGCATAGAAAGAAGGAGCACTGCGACGAAGCTTTGGATATGGACAAAGAAGGTAACTGGTCCGACCAAATAGACGACGCCAGCACCACCGTCTGCAGCAAGTGCGGGCACAACCTGCACAACGTGACGGCGCTGCAGAAACACGTCAAGGAGGTCCACGGTTACGCCAACCCCCACTCGTGTCCCGTGTGCGGGCGCAGTTTCCGTTCAGCTTCCATACGTAACGAGCACGTCAGGACTCACACGGGGGAACGCCCGTACCCCTGTGATGTGTGCGGGGTAGCCTTTAGACGATCAACGGCGATGCGTAATCACCGTCTCATTCACACGGGGGTCCGAGCGTGGGCTTGTCTGAGATGCCCAAAGAGATTCCGCATCAAATCAGACCTCAAGACCCACTTGAGACTGAAGCACCCGGCCCATTTGGCTGTTATTGAGGTCGAAGGTACCACGGCGTCCTCGGAGGACGTGCAGCAGCACCTGACCTTGAACAACATCAACCAGGACAAGGTCATTGAGATAACTATGATAACGTTCGCTAAGGTGGCGGAATATTCACACAACAAGGCGATCGATCGCGAAGTGAACAACGCCGAGAGCGAGGCACTACTGTCGCCTTTGGACGACGTCGCAAAAAATATAATACGGGTCGTTAATATCGAGAAGGCGTTCAGATGCGAGTACTGCGAAGATGTATTCTATTTGGAAGACGCTTTAAACAATCACAGGGTTATTCACAAGGGTGTGCAGAATCCGTTTACCTGTCACATTTGTAAAATGCACAACACACAAAACACACACGGTTTTTATAAGCGCAGTACGACAGAGGGTAACGAGGGTCACACTGGGCCCGCTAGCGCGGGGATTTTAGGGTACGGGGGATTTCCCGTCGTTAAGCACTTTCTATGCGAGGACTGCGGACGGTCGTATCTGCATTGGACATACTTGCAAGTACATCGCCGCATGAAACACGCAAACGAGAATTACATATTTAAATGTAACCAGTGCGAGCTAACTTTCCCAAATAGTTGGAGTGTTGCGTATCACAGGAAGAAAGTTCACGGAAAATCCGGCCAAGACGAGAATGGGGGTATTACTAAAATTATAAGAGACAACAGGATCCCGTGTCGTGATTGCGACCAGACGCTGCCAAACAAAATTGAACTCTACAAGCATAGAAAGAAGGAGCACTGCGACGAAGCTTTGGATATGGACAAAGAAGGTAACTGGTCCGACCAAATAGACGACGCCAGCACCACCGTCTGCAGCAAGTGCGGGCACAACCTGCACAACGTGACGGCGCTGCAGAAACACGTCAAGGAGGTCCACGGTTACGCCAACCCCCACTCGTGTCCCGTGTGCGGGCGCAGTTTCCGTTCAGCTTCCATACGTAACGAGCACGTCAGGACTCACACGGGGGAACGCCCGTACCCCTGTGATGTGTGCGGGGTAGCCTTTAGACGATCAACGGCGATGCGTAATCATCGTCTCATTCACACGGGGGTCCGAGCGTGGGCTTGTCTGAGATGCCCAAAGAGATTCCGCATCAAATCAGACCTCAAGACCCACTTGAGACTGAAGCACCCGGCCCATTTGGCTGTTATTGAGGTCGAAGGTACCACGGCGTCCTCGGAGGACGTGCAGCAGCACCTGACCTTGAACAACATCAACCAGGACAAGGTCATTGAGATAACTATGATAACGTTCGCTAAGGATACAACAAACATAGTTCCGAACTCATCCCGGGCCCTCGGCCTGCTGAATGACGTGCCAAGGACACGAGTGGTGTGGGAGAGACCGGCGCAATACTACGATGTGTTCCAACCTCGTCGTGGCAGGGGTATCGCCAAGACCCCGCGGCGGCCGAAAATATTAAGAGGGGAAGATCTACCCCAGGACGAGAACTACCCCCTGGACACTGATAACACGGAGACGGAGACTGATAAGATCCCAGATAACATCTATCCTGTGACCTTGAACGCGGACGGAGAGTTCCAAAACCTATCAACGCTGAACGTGCAACTTCTGCTACAAGATGGCGCTCTGGTGAACGGCAACCAGATGGTGGAACTGCAGCTGAACGACGAAATGCTGCTGGAGTAG

Protein sequence:

>DPOGS212514-PA
MEPQSPLILCMENVEGQEVALKIEPSDTFQIFLDNSKLLLGFEVDINSITGNQPVSLSDNIYTFLLNAEKNVAGNDLDQILDQNPESDDLVYVLDDGTQIRASQIQFDNEDPLIDLTAEKIPFVKYNDDGDEDVAEVGTVKDVNIIESPVTRWSKDCSPKCSFANSLPFKLVCNNTSNFDAQFSKYLEPTKTYTTLNPANRNKSPRSLIRDNYKNYDERNDFFTREDILNMFKDSPVTSLPCDGQNYEKRRHVRKTDPSRSVHKSWNYKTTPDVDNMSNNQDCFICGKIVENNEKLYLFDKEDQMLHRCEQRKFQQQLKIICERCLNENFKPSRMKSPSQSLNNDEYLVIKNNQQYIFQKITTIDLKKLNVESNKNSEYVKVEIGPDGEIITKPIDNDVNDVMVVKDEKKESSSDVEIIEPEVEIDIDNIEEADEQVKEFLGKYQCDATDNKELKCRFCERVFTELDQVIEHGEEHRHDVEDETVFPCPLCDYGYANFKYLKAHLKAAHINNKDSNSEEHDDKNTPKSSPVAKRTRSAVKKHDNENDDKNEKNSEETKANNTQFTTEVKQECVESSDESIWIVQTGDDDEQLNNLLQVKDDERDMNDRKKHKCFNCRAHHIRQFHDTTNLLTPDGRYVCSECENAEFATETELFDHVHFQHDKQKRWQCPVKDCGKTFFLRATLTKHSRTHTDTRRYVCVTCGKRFLDKQTLDEHGVTHLQIKPFQCHICLKQLTRRSRLRMHLRAHEEELSPALVLVCAICNRAFRDHVDAQEHATKSSECIEEFTNELKEEKEASEQLSPTSGLVRHTVHVVAEYSHNKAIDREVNNAESEALLSPLDDVAKNIIRVVNIEKAFRCEYCEDVFYLEDALNNHRVIHKGVQNPFTCHICKVSFATYSSWSVAYHRKKVHGKSGQDENGGITKIIRDNRIPCRDCDQTLPNKIELYKHRKKEHCDEALDMDKEGNWSDQIDDASTTVCSKCGHNLHNVTALQKHVKEVHGYANPHSCPVCGRSFRSASIRNEHVRTHTGERPYPCDVCGVAFRRSTAMRNHRLIHTGVRAWACLRCPKRFRIKSDLKTHLRLKHPAHLAVIEVEGTTASSEDVQQHLTLNNINQDKVIEITMITFAKVAEYSHNKAIDREVNNAESEALLSPLDDVAKNIIRVVNIEKAFRCEYCEDVFYLEDALNNHRVIHKGVQNPFTCHICKMHNTQNTHGFYKRSTTEGNEGHTGPASAGILGYGGFPVVKHFLCEDCGRSYLHWTYLQVHRRMKHANENYIFKCNQCELTFPNSWSVAYHRKKVHGKSGQDENGGITKIIRDNRIPCRDCDQTLPNKIELYKHRKKEHCDEALDMDKEGNWSDQIDDASTTVCSKCGHNLHNVTALQKHVKEVHGYANPHSCPVCGRSFRSASIRNEHVRTHTGERPYPCDVCGVAFRRSTAMRNHRLIHTGVRAWACLRCPKRFRIKSDLKTHLRLKHPAHLAVIEVEGTTASSEDVQQHLTLNNINQDKVIEITMITFAKDTTNIVPNSSRALGLLNDVPRTRVVWERPAQYYDVFQPRRGRGIAKTPRRPKILRGEDLPQDENYPLDTDNTETETDKIPDNIYPVTLNADGEFQNLSTLNVQLLLQDGALVNGNQMVELQLNDEMLLE-