Monarch geneset OGS2.0

DPOGS202711
TranscriptDPOGS202711-TA3807 bp
ProteinDPOGS202711-PA1268 aa
Genomic positionDPSCF300272 - 66289-70584
RNAseq coverage488x (Rank: top 25%)
Annotation
HeliconiusHMEL0165670.086.36% 
BombyxBGIBMGA004355-TA0.074.69% 
Drosophilapeb-PA8e-6756.49% 
EBI UniRef50UniRef50_D6WSG60.042.72%Putative uncharacterized protein (Fragment) n=2 Tax=Tribolium castaneum RepID=D6WSG6_TRICA
NCBI RefSeqXP_973372.20.043.13%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892388690.043.13%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1892388690.042.30%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00036761.7e-10nucleic acid binding
KEGG pathway 
InterPro domain[1133-1172] IPR0130871.7e-10Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL16797 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202711-TA
ATGTGCGTCGGTTTGACTCGTTCCTTTGTTTTTACAGAAATCAAAACAGAAGTCATCGTCGAGGCCAAAGTTGAAAGTGAGAAAGAAATGAAGATAGAACAGGAATCGCGCGCCACGCGCACCACCAGCATAGACAGCGACAACAGGAGCGATAGAAGCGAAGAAGATGACTGCAGGAGAGCGCGGAAGAGGATGGCCTCATCGTCTCCGTCTCCCATACCCTTGGACAAAAGGACAGCTCGCTTCGAGTGCCGGAAATGCCTGCAGCGTTTCGACTCCATAAACGCTTTCGATCTTCACCGATTTACAGCTCACGGAGACGACAACAGATCCGGATTCGACGATTATATAGATTATTCCAGCAAAAAGTTCCAAGATATCATACACCCTCACATAGCAGAGGGGCTGCGGTGTGAGCACTGCTCCCGCGAGTTCCCCACGGTTCAAATCTTGGATATTCACAGGAAGACATGCGGCGGGTCGACAAGAATATCCAGCCCCGACCGGGATCGCAGAGAGTTCTTCGCTAAGCTGGATTTAAGGAACAGGTCCTTCGGTATACCCGGGACGTTAACCCCACCTATGGATCGCTTCACACCAAAGTTTGACGAAACCCACCTCGCTAATGGAATTAGACCTATCGATGCCGCAAGAGACTTGGCCGATATTCAATCAATACTGAATGTAACATCTGCTGGAAGTCTCCTAGAAAGACTTACGGGTAGTCGTGTAGCTTTAGAAAGTTCGGTCCTAACACCTCCGGACACGGTCGTTAAGGAACGGGAACAAGAGGAGACTCAGGATAATTTCGCTGCTGAGTTCAGGAGAATGAAGCTCCGTGGAGAATTCCCTTGTCGATTGTGTCCCGCAAAATTCCCAAATCTTAGAGCTTTAAAGGGCCACAATCGGGTTCATCTGAGTGGAACCGGCCCAGGGCCATATCAATGTAATATGTGCCCGCATGCATCCCTGGACAAAGCAGCTCTTGTAAGGCATATGCGTACTCATAACGGAGACAGACCATACGAGTGCGCTGTGTGCAACTACGCCTTCACAACAAAGGCTAACTGCGAACGACATCTAAGAAACCGTCATGCAAAGATTACGAGAGAAGACGTGAAGCGGTCTATCATTTATCATCCATCTGAAGACCCCAACAATGATGAGGTCAATTCAAAGCTCGCTCGTGAGGAGGTCAAAAGGTCCTTGGCCTTCCACACGCCGGAGGTTGATAGACGCAATGAATCATCTGGACGTAACACTCCCCTGACTCACTTTAATCCCAGTTTCATCACCGATCGACACCCTGTTACTTCTTTGACTAAGCCTTTACCTGAAACGCCTTCTTTAACCCCAAGAGAACCTGAGCAGCCCATGCCCAGAATAAAAGTCAAAGGGATCGGCCAATTGACCCAAATTCCTGAATTTAGGACTCCAGAGTTAACGTTTAAACCAAACGACATACAGTCCGAAGCGTACAGCGAAGAAGCACCAGTAGACCTCAGTACGAGCGACAACAACAACTGCGATGTTTTAGATCTCAGTAAGAAAAAACGCGATGTAGACGATGACGAAACAAAGACAACACCTCGCTCTGCCTTCGATAATTCGTCTGCATCCGCTGCTGCATCCGCCGCTGCATCTGCTTTCGAAAAGACAAGGCTTCTTCTAGCTCAGCAAAGACTTTTCGAAACCTCTCTACCGAAAATCGATCCAGCGTATTACGCAACCCAACTTTCTCAGCTCTACGCCAGTGCAGTACCAGGCCTTCCCGGTTTGCCGATACCGTCGTCATTCCCCATAAACCCATACTTTCTCCAAACTTCTTTTTTTCCCCACCCAACAGATTCGCAAGAAATAACTGAAATCAAGCAGCGTATTCACAAAGAAATATTTAGAGGTCTAAGTATGTCTGGTGGGAGATTAGTGGACGATACTGAAACTCAGCTCAAACAGGAGCCAGAGGAAGAGGAAGTTAAACCAATCCCAGTAGCCTCGACTCCGACTCCCCCTCCCCATGCTGAATCTCCCCGTCCGTTGAGTAATCCATTAGTAACACAAAACGATTCCGTTAAGATGGTAATAAAAAATGGCGTTCTTATGCCGAAACAAAAGCAAAGGCGATACCGGACTGAAAGACCTTTCTCCTGCTCTCAGTGTTCCGCTCGGTTCACACTGCGCTCAAATATGGAACGGCACGTCAAACAACAGCATCCTCAACACTGGAGCGTCAGACGTCCATCGCCAAGAGCACCCCCGCCGTATCCTACAACAGACACTCTAGCCGATCGTGTAAAATATGCCTTACTCGCTCGTCACTTGGAGAGACCCTTGCAGGCAGATCGCTCTCCTATTCGAAGAGATTCGGATGAAGTAGCAGACAACGAAGAGGACGAGGACGACACCTTGGTCATCGACGAGGAACCGGAAGACCGCAAGCCCGAGGAGCACACGGCCGCTCGGAGAGCTGCAGCTGAAATTCTCATGGCCTCGTGTCAGCAAGAAATGCATAAGGATTTTGATCTCAAAATAGCTGGAAATCTTATTAACAAACCGGTTGCCCCAATCACAACTGACAAGGTCGAAGCTAACGCAGATCTTCTACCAACTCAAGATGCCATACCAGTTGTTCCAACTCGCAGTGACGAAGAAGAAGATGAGGAAGGTCTCGTGGCTTCCACCAGTGAGGGAAACAATTCGGGAAGCGATGAAAATAAATCTGAATCCGACACTGGAACGCAGCCTCCAAAGAAGAAATCCGCTTACAGTCTAGCACCCAACCGCGTCTCCTGCCCGTACTGCCACCGCAAGTTCCCTTGGTCCTCATCCCTCCGACGCCATGTTCTGACACACACGGGACAGAAACCTTTCAAATGTCCTCATTGCCCCCTTCTATTTACAACGAAATCCAACTGCGACCGCCACTTACTCAGGAAGCATGGGGGCTCAGCCAAAGCAATTCTCGCCGAACCCATTCCGGACACCATAACACCTCAAAACAACGAAAGCAGATCTGTACCTGAGAGGCCTTTTAAATGTGCCTCCTGTCCCACCAGCACGTTCTCCTCTATGGAGACATTGAAGAAACACATGTCTTCTCGGCACGGGACGGCCGAGTCACAGCCGAATTCACCAAACCCTGAAGTGGAAGACGCTGATGGAAGCCTAGTTTTCAAGTGTCACCTCTGCGAAGCGTCGTTCGGGGACCGATCCGGAGCCCTAACACACTTGGCATCAGTCCATGCCGCGGAGTACGAACAGCTTGTGAGCAAGGGAGCTCTAGACGCCGTCAGCGATCGCAGTGAAAGCGCCGACGACGATGAGAAGGGGAAGTTCCCAGATCACGCCAACAGAAAGGTCGTGTGTGCTTTCTGTGTTCGTCGTTTTTGGTCTGCGGAGGACCTGCGTCGTCACATGCGGACTCATTCTGGAGAGAGACCCTTCGCCTGCGACCTGTGCCGGAGACGGTTCACGCTCAAACACAGCATGCTGAGACACAGGAAGAAACACAGGGAGGATAGTGACGATGAAGACTCGCCCAGGGACAATGGATACCGCTACCACGATGAAGAGGGTTCTGGTAACGAGGTCCCCAGCAACGTCAACAACAATAACTCCCCGCCGGCTGCGGACAAGAAACTCAAAATAGAAATGACATCGCGCAAATATTCCTCGGAGAACGAGAACGACGCGGAAAATGGCGGAGATCTCATCGGGAAACTACTCGGGATACCGGACAAGACCATCATCAACAAGCTGCTCTCATCAGCGGATGAAGCTGCCAAATTCCTTGGCGTGAACAAATGA

Protein sequence:

>DPOGS202711-PA
MCVGLTRSFVFTEIKTEVIVEAKVESEKEMKIEQESRATRTTSIDSDNRSDRSEEDDCRRARKRMASSSPSPIPLDKRTARFECRKCLQRFDSINAFDLHRFTAHGDDNRSGFDDYIDYSSKKFQDIIHPHIAEGLRCEHCSREFPTVQILDIHRKTCGGSTRISSPDRDRREFFAKLDLRNRSFGIPGTLTPPMDRFTPKFDETHLANGIRPIDAARDLADIQSILNVTSAGSLLERLTGSRVALESSVLTPPDTVVKEREQEETQDNFAAEFRRMKLRGEFPCRLCPAKFPNLRALKGHNRVHLSGTGPGPYQCNMCPHASLDKAALVRHMRTHNGDRPYECAVCNYAFTTKANCERHLRNRHAKITREDVKRSIIYHPSEDPNNDEVNSKLAREEVKRSLAFHTPEVDRRNESSGRNTPLTHFNPSFITDRHPVTSLTKPLPETPSLTPREPEQPMPRIKVKGIGQLTQIPEFRTPELTFKPNDIQSEAYSEEAPVDLSTSDNNNCDVLDLSKKKRDVDDDETKTTPRSAFDNSSASAAASAAASAFEKTRLLLAQQRLFETSLPKIDPAYYATQLSQLYASAVPGLPGLPIPSSFPINPYFLQTSFFPHPTDSQEITEIKQRIHKEIFRGLSMSGGRLVDDTETQLKQEPEEEEVKPIPVASTPTPPPHAESPRPLSNPLVTQNDSVKMVIKNGVLMPKQKQRRYRTERPFSCSQCSARFTLRSNMERHVKQQHPQHWSVRRPSPRAPPPYPTTDTLADRVKYALLARHLERPLQADRSPIRRDSDEVADNEEDEDDTLVIDEEPEDRKPEEHTAARRAAAEILMASCQQEMHKDFDLKIAGNLINKPVAPITTDKVEANADLLPTQDAIPVVPTRSDEEEDEEGLVASTSEGNNSGSDENKSESDTGTQPPKKKSAYSLAPNRVSCPYCHRKFPWSSSLRRHVLTHTGQKPFKCPHCPLLFTTKSNCDRHLLRKHGGSAKAILAEPIPDTITPQNNESRSVPERPFKCASCPTSTFSSMETLKKHMSSRHGTAESQPNSPNPEVEDADGSLVFKCHLCEASFGDRSGALTHLASVHAAEYEQLVSKGALDAVSDRSESADDDEKGKFPDHANRKVVCAFCVRRFWSAEDLRRHMRTHSGERPFACDLCRRRFTLKHSMLRHRKKHREDSDDEDSPRDNGYRYHDEEGSGNEVPSNVNNNNSPPAADKKLKIEMTSRKYSSENENDAENGGDLIGKLLGIPDKTIINKLLSSADEAAKFLGVNK-