Monarch geneset OGS2.0

DPOGS202114
TranscriptDPOGS202114-TA5082 bp
ProteinDPOGS202114-PA1693 aa
Genomic positionDPSCF300150 + 37905-45747
RNAseq coverage504x (Rank: top 25%)
Annotation
HeliconiusHMEL0039660.080.41% 
BombyxBGIBMGA006957-TA0.065.53% 
Drosophilacrol-PE6e-2222.41% 
EBI UniRef50UniRef50_F7ASP21e-4124.34%Zinc finger protein 850 (Fragment) n=22 Tax=Xenopus (Silurana) tropicalis RepID=F7ASP2_XENTR
NCBI RefSeqXP_001945749.12e-4622.51%PREDICTED: similar to mCG7830 [Acyrthosiphon pisum]
NCBI nr blastpgi|3343263861e-4422.65%PREDICTED: zinc finger protein 850-like [Monodelphis domestica]
NCBI nr blastxgi|3343263861e-5922.46%PREDICTED: zinc finger protein 850-like [Monodelphis domestica]
Group
Gene OntologyGO:00056344.8e-15nucleus
GO:00082704.8e-15zinc ion binding
GO:00056229.7e-06intracellular
KEGG pathway 
InterPro domain[8-84] IPR0129344.8e-15Zinc finger, AD-type
[1528-1548] IPR0070879.7e-06Zinc finger, C2H2
Orthology groupMCL21911 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202114-TA
ATGAATGTAAATTATGACCGTGTTTGTAGACTGTGCTTGTCATCTACATGTGAATTACTACCGATTTTTCCTACCACCAGTTCGGATGACTCGGAACCTCCTGTCCTCGCTTCGAAGATTAAAGATTGCGTGTCAGTACAGATAAGGGAAAATGACGACCTGCCAACCAATGTCTGCAGGAAATGTATGGATAATGTCAATAACTGGCACATATTTAAGGCAGTATGTGAAAGAACACAAAACAGATTGCTTTCGTTTTTAAATAAGGAGAGTGACCAACTAGAAGAGGTGAAAATAAAAAATGAACCTTTATCTGATGAAGCCTATGATGATGGAGTAGTCATTGATGGTTCATATCCTGAAAGTGAAAATGCAGCCCCATCTAGTAAGGTCCAACCGGAGGGTCCACCTATCTTGGCTTCATTGGGGCTCACACCAAGAAGTGATAAGAAATATGTAGATCCAAGGATGGATTGGCAACGAGTCCATGCTATATTAGATATTGTGCAAGGGAGTGATGTCATTGATTCGCTCCAATCTAACGAAGAGTGTGATGTTCTGCATGTATCGGATCACGATTCAGATTCTGAGACAGATTTATATGATGATGATAACTTAAATGATTATATAAATTATAGGAATGGTTATAAATGTAAAAATGGTAATATCCTATCAAAATATCCCAAAGAATCGTCACTTGAAACAAACAATAAAATAAAGGATGTTATCAATTTGAGGAAAAATAATCTGCAAGCGCTTGAGTTTTGTCCCCGACCTATAAACGCTTGGCAAACACTGTTTACTGATGACCTATTAGAACTTATGGTTACATCTAGTAATCACAAAATATCTAGAAACCATGATGTAATGGAATCTACTAATGGAATGGATAGCGAAAAGGATGAGGACTATGAAGAGGAGGAAGAAATTGACCAACAGCAATTCACACACGTCCCAAACATGCCAGAAGTTTCCATTACGGTCATGAGACCAACCGGTGAAACTCTTCACGCTCGTCAGGGTATTCAAGAAATAGCGTCCAAGGATTGTCTAGTTTGTGGAAGGTCGTACAGATATTCTCACAATGCCAGAAGACATGAATTGAACGCGCACAGCTTTGACAGATATACAAATAAAATAACGAATAAGAAACAGCAATCACACATGCAACCAAAGTTCAGGCCGAATCCATTCAACCCCAAAGCCCGCCTCATGCCGAATCCCATAAGCCACAAAATGCAATTTCATTCAAAATCAATTCCGGGTAGAATGCCACAGAGGATTGTCGCACCGCCGAAGCCGATACCAATTAAAACCGCTAAAGCTTCGCAGAATAACTTGCCATATCCTCTTCGTATAAAAGCATTAAAAGATCTGCAGATCAAAAAGAAGGAACCACAAATTTTAAAGACACTATTGACTTCCAAACCGGAAGTTTTGGTGTCTGAGCCTGAAATAATTAATTCAGGTCCAGAAAGTCCTGAGACGTTAATTTCCGAACCGGAAATAGCGTCTTTCCAAGTGGAGACAATTTTATCAGAACCAGATGGATATGTCAATCAACAACAAGATGATGACGAAGGTGTTAACGATAATAACCAAGGCCAACACTATGACACCGTTGATATGGAATCTGATAATGAAATTGAAATTGCACGGCAAAATGAAAATGAGATTGATGTAGATGAGAATCCCATGAATGATCCCGAAGAAAACCAGAATGATGAAGAGAATAATATGGATGATGGGCAAGAGGACAGTGATAGAGGTATTGATGATAGTGATGACAGACAGGAGGATTTAGTAAATAACGTGGATGGTGAAGGTGGTGATGCCAAAGAAAGTCAGGATGATGAGGCGGAAAAAGATAATTACCAAAATATTAAGAATGAAGATAATGAAGATGACGAAATGCCAGCATTGAACATTGCACCCGTCGTCGAGATAAATGAAGACATGCAAACCAACTCTTACAACAGTGACGGTAACGAGGAAGAAGAAGCTGACGAAACGGTTGATCCAAATGACACGGTTGATGGTGATGAAGCCGAAAAAGATTTGGACCCAGATAAGGTGTACATCACAAAAACTCAAAGAGACTTCATCTTGAAGTACCGTGATATCATCGAACAGATCAATACCAAGCGGTGTCTTTGTTGCAATAAAGAACATCCACGCAGGAAAGCGGTTATACAGCATTTGCAGAAAAACGGACACAAAGTCCCAAAGCACACTTGTTATAATTGCGTAGTGACCTATGGTCATATCGGTGCTTTGTTGAGTCACATGAGATCGAACACTTGTACTGATCTGTGGAAAATAATTTATAACGAAAATGGTATAACCGATGATATCGTGTTAGAAGATAAACCCGATAACAAAATTCCATACAAAGATGTTGTTAACGCTAGGTCCTATGCTTGTAAATTATGTCCAGCGAAATTCCAGCTGAAACAATTTATTATGAAACACGTTTTAGATGTCCACGAGGATGGCCAATCACGTGTTCCGTTTTCTTGCGTCCATTGTGGAATGAGGTTTAAAGACAAAAATATTGGTAAAAGACATATTCGCAATGGAGATTGCACTGTGTATATAGCTTGTGAGTTATGTTCTGACAAGTTTTTGAACATGCAAGATTTCAATGATCATGCTGTTTCCGTGCATGCTGGCAATTTAGATCCCGAAAACCAAAACAAATGTGTGGACGGTCGACCGACTGATTGTCCTATCTGTGGTAAAAAGAACAGTAGTTATCCCAATTTAGTGAAGCATTTGAAAGCTGTTCATAACGAAGAAAAGCCTCATCACTGCCAACATTGTGACTCTAAATTCGAGCAAACTACTGATCTTAACAAGCACATATACATGGAACATTCTGATAGAAGTTTGGGTATGCAGTCTATTGAGCCCGACATGTCCATTGTGAAGGAGGAAGCCGAAGAATATCATTATTCATGTACGGAATGCAACGCCATCTTTGAAACTGTCGACGCGTGGACGGATCATCAAGTTGCCGAACACAACCAGGTCGCACATCACTGCGACCAGTGTGAAAAGAAATTCTTACGACCATCAGAACTTGCTGAACATAAAAACACACATTTGCGAGTTAAATTTTATCCATGTAACGTATGTTCGAATTCCTACAGCACACCACAAAAGCTGACTGAGCATATGCAGCAGATGCATCCTGGGTCCAACGCACGTGGAGGTGATAGCGATTTCTATTGCGATATATGTGTTAGATCGTTCAAGAGCCGACAAGCCTATTCTAATCACATGCGTATACACTCTAGAGTTCCGACAACTAACAGGAAACCAGGAGAATCAAAAGGATTTGCTCCTCAAATTGTCGGTAAACCCATAAAGCAATTCTCAATGCAGCCTGGTTATAGCCCCTACGTCCCCAATGCACCATATTGTTGTGACATTTGCGGAAAAGGATTCATGCATAAGAAAAATATATGGAAACATAAAAAAGTGCTACACGCCGATCTACTTAATGATAGGAATGACAGCGAGGAAAATACCATGCATGCTTCTACAGAAGAGGATGAATATAATGTTGATGAAAACGGTGCCATTTTAACGACGCCACAATTTAATAGTTTTAATTTCACTAACGACGCTTTGCCATATTCATGTGAGTTATGCTATAAACGTTTTCCGCTGCGGACGTATTTGTGGAAACACAAACGCGCCAAGCACGGCATCACCAAACCTAACGCCGGTGAAGCTTCAGAAACGCAAACACAGCCGTCATCGGCTGAAGGGAGATCTAGTTGTACGATATGTAAAATAACATTCTCCGATAAGAAATCATACTATCGTCATAGGAAAAACGTTCACAAGTCGACTGTCCAGATGTGTAAGATATGTGGAAAGCCTTTGAATTCAACTTTGGAATTGTACGAGCATCTTAAAGCTGCGCATGCTCGCGAGTTACTTGGATACAACGCCAACCAAGGTCCGAGCAAATCGCAGGAGGTTGTTCAGGAAATGGAGGTGGAATATGACGAGGATCAAGATTCCGCTGACCCGAGCGTCGATTACCAAGCCAGATACCCGTGTGATACTTGTGGGAAGCAGTTCGTTGGACTTCTTGCTCTGCAGAATCATCAGTGTATAAACCAGATTCAGACACAGCCACAAACATTCGAATGCGAAATTTGTCACAAGAGCTACACGTCAATTGCCGCACTGAAGAGCCATCGGGGATGGCATTTACGGTCTCCAGATGGAAAAGCGGCTGCCAATAACACTGGTTTATGGATGCCCCAAAACAAAGTGACGACCAAAGTCAGCAAACACGAAGTTGTAGATCCGTCTCAACTCGCGCGTGTTCAGCATTCAACACCAGCCAACATTGCTAAAAGGAGATTACCGCCGGAAGTCGAAATAACTGTGGTCAATCCGAACAAAAAGCTTCGGTCCGATGATTCTATTGAATTAGATCATCAGAACAATTCATCTGGCGGTCCCGAAGACAAATACTGTAACATTTGCGATAAAGAATTCACGAAGCGAGCGGCCTACCAGCGTCATATGGACGAAGTTCACCAACCGAACTCCGTGTTCTGTCCCGTATGCGACAAGAGCTTCACCAGGAAATCGACATTAATAGTTCACATGAAGAAGCACTACGAAAGCGGAGAAGGTACATCAGGGTCCACGCAAATGGATGAAGATTCGCACACGTGTGACGTATGCGGCAGTGTGTTCGACAGCTCGAAGTCTCTGATGGCCCACAAGAACATGCATCATGGAGAGGATGAATCCGACCAGTCTGAAGACGACGGCGGTGCGACTATACAGCCCCCAGGCGAGTTCACGTGCGCTCAGTGCGGCGACGGCGTCGCTACACCACGCGACTTAATAGCACATCGAGCTATGCACGCCACTCCGACGAAGTTCTTCTGTAATATTTGCAAGGTCTACTTTGCTAGAGCGTTGGACCTCTCCTCCCACACTCGAGCCAGACATTCTGACAACGAAAAAGTATTCTTCCCTTGCGCGATGTGCGACCGTTTCTATATGAACAAGAAGAGTTTGCAACGCCACATAGAAATGGCTCACTGA

Protein sequence:

>DPOGS202114-PA
MNVNYDRVCRLCLSSTCELLPIFPTTSSDDSEPPVLASKIKDCVSVQIRENDDLPTNVCRKCMDNVNNWHIFKAVCERTQNRLLSFLNKESDQLEEVKIKNEPLSDEAYDDGVVIDGSYPESENAAPSSKVQPEGPPILASLGLTPRSDKKYVDPRMDWQRVHAILDIVQGSDVIDSLQSNEECDVLHVSDHDSDSETDLYDDDNLNDYINYRNGYKCKNGNILSKYPKESSLETNNKIKDVINLRKNNLQALEFCPRPINAWQTLFTDDLLELMVTSSNHKISRNHDVMESTNGMDSEKDEDYEEEEEIDQQQFTHVPNMPEVSITVMRPTGETLHARQGIQEIASKDCLVCGRSYRYSHNARRHELNAHSFDRYTNKITNKKQQSHMQPKFRPNPFNPKARLMPNPISHKMQFHSKSIPGRMPQRIVAPPKPIPIKTAKASQNNLPYPLRIKALKDLQIKKKEPQILKTLLTSKPEVLVSEPEIINSGPESPETLISEPEIASFQVETILSEPDGYVNQQQDDDEGVNDNNQGQHYDTVDMESDNEIEIARQNENEIDVDENPMNDPEENQNDEENNMDDGQEDSDRGIDDSDDRQEDLVNNVDGEGGDAKESQDDEAEKDNYQNIKNEDNEDDEMPALNIAPVVEINEDMQTNSYNSDGNEEEEADETVDPNDTVDGDEAEKDLDPDKVYITKTQRDFILKYRDIIEQINTKRCLCCNKEHPRRKAVIQHLQKNGHKVPKHTCYNCVVTYGHIGALLSHMRSNTCTDLWKIIYNENGITDDIVLEDKPDNKIPYKDVVNARSYACKLCPAKFQLKQFIMKHVLDVHEDGQSRVPFSCVHCGMRFKDKNIGKRHIRNGDCTVYIACELCSDKFLNMQDFNDHAVSVHAGNLDPENQNKCVDGRPTDCPICGKKNSSYPNLVKHLKAVHNEEKPHHCQHCDSKFEQTTDLNKHIYMEHSDRSLGMQSIEPDMSIVKEEAEEYHYSCTECNAIFETVDAWTDHQVAEHNQVAHHCDQCEKKFLRPSELAEHKNTHLRVKFYPCNVCSNSYSTPQKLTEHMQQMHPGSNARGGDSDFYCDICVRSFKSRQAYSNHMRIHSRVPTTNRKPGESKGFAPQIVGKPIKQFSMQPGYSPYVPNAPYCCDICGKGFMHKKNIWKHKKVLHADLLNDRNDSEENTMHASTEEDEYNVDENGAILTTPQFNSFNFTNDALPYSCELCYKRFPLRTYLWKHKRAKHGITKPNAGEASETQTQPSSAEGRSSCTICKITFSDKKSYYRHRKNVHKSTVQMCKICGKPLNSTLELYEHLKAAHARELLGYNANQGPSKSQEVVQEMEVEYDEDQDSADPSVDYQARYPCDTCGKQFVGLLALQNHQCINQIQTQPQTFECEICHKSYTSIAALKSHRGWHLRSPDGKAAANNTGLWMPQNKVTTKVSKHEVVDPSQLARVQHSTPANIAKRRLPPEVEITVVNPNKKLRSDDSIELDHQNNSSGGPEDKYCNICDKEFTKRAAYQRHMDEVHQPNSVFCPVCDKSFTRKSTLIVHMKKHYESGEGTSGSTQMDEDSHTCDVCGSVFDSSKSLMAHKNMHHGEDESDQSEDDGGATIQPPGEFTCAQCGDGVATPRDLIAHRAMHATPTKFFCNICKVYFARALDLSSHTRARHSDNEKVFFPCAMCDRFYMNKKSLQRHIEMAH-