Monarch geneset OGS2.0

DPOGS203772
TranscriptDPOGS203772-TA1845 bp
ProteinDPOGS203772-PA614 aa
Genomic positionDPSCF300010 + 641747-649879
RNAseq coverage120x (Rank: top 58%)
Annotation
HeliconiusHMEL0042246e-17062.53% 
BombyxBGIBMGA007089-TA2e-2927.39% 
DrosophilaCG5245-PA1e-3128.24% 
EBI UniRef50UniRef50_UPI00020F6A301e-3430.16%UPI00020F6A30 related cluster n=1 Tax=unknown RepID=UPI00020F6A30
NCBI RefSeqXP_002031464.18e-3529.53%GM24032 [Drosophila sechellia]
NCBI nr blastpgi|3266765316e-3730.46%PREDICTED: zinc finger protein 850 [Danio rerio]
NCBI nr blastxgi|3266765312e-4429.81%PREDICTED: zinc finger protein 850 [Danio rerio]
Group
Gene OntologyGO:00036767.4e-06nucleic acid binding
KEGG pathway 
InterPro domain[344-378] IPR0130877.4e-06Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL34513 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203772-TA
ATGGCACAAATACAGATTAAAATAGAGCCTGAAGAAGAAGACGATTCTATGGATGTGGAAGTGAGTTTAAAAATGGAAAATGGAGATGAGAGCCAATCGGCTGTGAGCATTACTTCACAACATTTATTAGAAGAGAATGTAGTTGTGATTAAAGAGGAATTACGAGATAATATAGATGTAAAGATAGAACCTCTAGATATTAAAGATACTAATGAAGAAGATGGGGCTTTAGATGAGAAACAGGGAATATATTACGAAAGTGAACCCGAAGATCTCTCGGTACGGAAGCCCACCGGTTATAGCTCTGGAGATGAAGATAAATCACAAGGTTCTGATCTGGACTACCTCCTGCCACTCGCTGAGGACAAGACAAAGGAACCAAATGTGAAGCCTACGTTCAATGGAAAAATTAGAAAGGACAAGGAACAGAAATATTCAGACGAGATAACGAAACACATAGAGATAGTGACTATAGATGAACCGGCCCGTCAGCTGGAACACCGCGAACTGCTGGCGGGCCGGCTGCACATGAACTACACCTGTGAACCTTGCGCGCTCGGGTTCGTCGTGGAGGAAGCGTACGTCATGCACATGAAAATACACTCGCCGGAGAATGGTCCACATGAGTGCAGTATATGCAAGTCTCGCGTCAAATCCCTGGACGTGTTGTATCGTCACCGACTGCGTCACTACCGCCGCTACCGCTGCGCCATCTGCCGGCTTCAGTTGCGTGATAAAGACACGGTCGCCGCTCACGTCATGAGAGAACACTTGGGATCCGCTTTCCTTTGCACGCATTGCGGCAGAGGGTTCAAACGTCCACAATATCTGAAGCGTCACGTGGAACAGATGCACACTCGCCCGCTCCACCTGGAGTGCCCCGTGTGTCACAGGGTGTTCTACGAGCGAGGCTGGTACAGGTGTCACGTTAGAACCCACAACGAGCAAGTAAAGCAGCGAGCTGATCGTAAAGCGGTGTGTTCGCACTGCGGGCGCGAGTTCAGAAATAAGTCGTATTTGATACGACATCTTCAGACTCACGAGGATCGACGACAGGTGCGGTGTCCGCAGTGCGCGCGCTCATTCAAGAATAATGAGGTGTTGAGAGTTCATAGACGACAGCATCACACCGAGAACCCCTCCAGATACAGCCTCGACAGCGACGGCTTTAAGATTTACCCTTCAACTCTATCGGGACCAGCGAGTACAACCTGCGAGCAATGCGGCCGAGTGCTCACGACACGCGCGATGCTCACGAGACACGTTAACAGGATGCACACGGACAGGACCAAGAAGTTCCAATGTGATTACTGCAAGCGTCACTACTTCTCGAAAGCGGAGGTCCGTTCTCATATCGAGTGGACCCACCTCCAGCAGCGGCGACACGCGTGCACCTGCGGCCGGGTGTTCCGTACACCGGCTCGACTGAGGGCCCACGCGTGCGCCGTACACCTCAGGATACAGCAGCCGAGGGACAAGACGTGCCCCGTCTGCGGCAAGATGTTCGCGAACCAGCAGGTGTTGACGCGTCACATCCGGGGTCACTCCGGAGAGACCTACCCCTGTACGGAGTGCGGGCAGTCCTTCAAAACACAATCCTACGTAAAGATACACTACAAGATAAAACATCTAAATATGACGCGAGCGGAAATTAAGGCTCAGAGCAAAAGGAAACTGATCATGTTGGAGAACGTAGACGAGAGTATGAGCGCCAAGATAAAGAAGAAGAAGAGCCTAAAGAAGGATCCCTTGAATATAGAGGGGGCTGTCAGAATAAAGAAGGAGATAACAGAGCTCACGGTACCTCTATTTGAAACGTTCGTTGATATACAAAGGGAGTATTGA

Protein sequence:

>DPOGS203772-PA
MAQIQIKIEPEEEDDSMDVEVSLKMENGDESQSAVSITSQHLLEENVVVIKEELRDNIDVKIEPLDIKDTNEEDGALDEKQGIYYESEPEDLSVRKPTGYSSGDEDKSQGSDLDYLLPLAEDKTKEPNVKPTFNGKIRKDKEQKYSDEITKHIEIVTIDEPARQLEHRELLAGRLHMNYTCEPCALGFVVEEAYVMHMKIHSPENGPHECSICKSRVKSLDVLYRHRLRHYRRYRCAICRLQLRDKDTVAAHVMREHLGSAFLCTHCGRGFKRPQYLKRHVEQMHTRPLHLECPVCHRVFYERGWYRCHVRTHNEQVKQRADRKAVCSHCGREFRNKSYLIRHLQTHEDRRQVRCPQCARSFKNNEVLRVHRRQHHTENPSRYSLDSDGFKIYPSTLSGPASTTCEQCGRVLTTRAMLTRHVNRMHTDRTKKFQCDYCKRHYFSKAEVRSHIEWTHLQQRRHACTCGRVFRTPARLRAHACAVHLRIQQPRDKTCPVCGKMFANQQVLTRHIRGHSGETYPCTECGQSFKTQSYVKIHYKIKHLNMTRAEIKAQSKRKLIMLENVDESMSAKIKKKKSLKKDPLNIEGAVRIKKEITELTVPLFETFVDIQREY-