Monarch geneset OGS2.0

DPOGS210396
TranscriptDPOGS210396-TA2208 bp
ProteinDPOGS210396-PA735 aa
Genomic positionDPSCF300291 + 14820-17438
RNAseq coverage341x (Rank: top 34%)
Annotation
HeliconiusHMEL0130040.060.73% 
BombyxBGIBMGA008282-TA0.063.40% 
Drosophilacrol-PE6e-2527.38% 
EBI UniRef50UniRef50_UPI00020624673e-2722.66%UPI0002062467 related cluster n=4 Tax=unknown RepID=UPI0002062467
NCBI RefSeqXP_001946669.12e-3127.85%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp), partial [Acyrthosiphon pisum]
NCBI nr blastpgi|3287113632e-3027.41%PREDICTED: zinc finger protein 845-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287068193e-3826.94%PREDICTED: zinc finger protein 91-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00036763e-07nucleic acid binding
KEGG pathway 
InterPro domain[566-602] IPR0130873e-07Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25458 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210396-TA
ATGGAGGATGAGACTATCATTACGAACACTGATACATTGTACCCGGTACTGCTTATACCAAAAAAATGCACCACATTGGAGACGAATTGCGAGGCTGCGGACAATGTGTTCATAGATTTAAACATAACCTCCCCCGACACTGAAGGTAATGAAACCTGTATTTATTCGGGGTTCTGCAACGTTAAACTTATAGATAGCTATGACGTTAACATCAACGAGGGAGATTCCAACAGTCAATCCTCGGATTCCAAGGACGATCACCAAATTCCTGTGATAGTGAGCGAGGCCAGTCAGGATCCGACATTCAATCAGTCCCAAGTATGGATAGATCCGGCCAGGAGCCCGTATGTTTACAACAGCTACGACGCAGTGGAGTCCCAAGAACATTACTCAGTGTACAGCCAGACCCTCGCCAGCCACATCACAGTCGAGCAACACAATACGTACATAACACAGAATGTATACAACTACAAAGAACAAGTACCGTATTCGCCGATCGAGGAATTCGGTCACAAGAAATTCAGAAAGAATCATGGGAATGAGATAAATCATGATTTGTACGAGGATGACGAGTTTGAATGCGGCGTCCTGCTGTCTCAGTTGCCCGAGGAAATCATGAATGAGAAAGAGAATAAAGCGAGAGAGGAATTGATGTACATTAATCTTCAGAGTGAAAATGATGCTCTGAAGCAATTGATGTCTTCAGATATACAGTCTCTGTCGAAACAACAGAAGACCATAATGTACCTGGGTCATGATTCGCATTACGGCCAAACTGTTCAGAAAATTGCAGTCAACGATCCCTCCTTGGAGTGCAATGGTGTTGGTAATGAAGTCATTGTAGAAAGTACTCAACCTAATCATAAAAAGTATCAATGTGATAAATGCAAGCAATTTTTTGATCAAATAACAACTTTCAAACAGCACATGGTCGGTAACCATAAGAGTCCAAAGGAGTTATTCAGACCCAACCCCATTAGAGAAATGAAATTCATGTGTACCGAATGTGGCAAACATCTGAAGAATCAGAAGAAATTTGAACTGCACTGTCTCGGTCATGGTAACCCTGAGTTAGAGTGCAACAAATGCCACAAAGTGTTCGCTTCAAAATTCACTCTAAGAAATCATTTAAAAATACACCAGAGAAAATTTCCTTGTATGTATTGTACTAAAACTTACTCAAGTTCTGAGGAACTCAGGTCTCATAAGGCTAAGATACATTTTATTTTCATGTGTGATATTTGTGATTACACAGCAACAAAGTATCCGGATCTGGAGGAACACAAACAAACACACGCCAGTGAAGTAGATTTGAGTGATTCCGACTTCACTCTATCACTCAACGGTGACATCATCAGCGACAGCGGCAGTCTATCACCATCCACCGAAGATTACATTATAAGATCCGAAGACTTCGATCAGAAATTGGATGAAATTAAAAAAGCCGATTCGGTGATCGCTAAGGTGATGTCGAACAAAATGTTTCTGCTTCACACCAAGAAGGCTCGACGAGATAAAAAATATAAAAAAACTTGCGACGTGTGTTTAAAGACCTTCGATCGTATCGGCGACCTCAAAAGGCATCTCATAGAGCACGTAATTAGAAGCACCCTGGCAAAAACTCCAGTCAACAAAAACGGTACACTGACAATACAATGTGAGGTCTGTCAGGTCGAGAATTTCACTAAAATCGACAAATACAAAGCTCATTTACGCGAACACGCGAAACTCACCTTATATAAATGCACATTTTGTGACAAATCCTTCAGCGACTCCAGCAACTTTTCGAAGCACAAGAAAATTCACGGCACAACATTCTTCCAATGCGATCTCTGTCAAAGGAAATTCAACTCCAAGAAGATGATCGCACAACACATGGAATATCACAATAACAATTCTCCAATCCCTTGCCCGTACTGCGATAAAGAGTTTCATTTTCCCTCGATGTTGAATAAACACGTAAAATGTGTTCACAATCGAGAAATGTCCAGATTTAGGTGTAGATTTTGTCATGAATATTTTAAAACACTTAAAGATAAATGGGATCATGAATGGAGAATACATAACGTTAGGAAGATTATCGTGGACTGTTTGATTTGCGGTTCGAAGTTTCGAAAGTATTCAGAGTTAAAAAGGCATTGCACGGACGTTCACGGTTTAGATATACCTCCCGCCAAAAAGTTGCTGCGTAAACGGAGATTATTGTAG

Protein sequence:

>DPOGS210396-PA
MEDETIITNTDTLYPVLLIPKKCTTLETNCEAADNVFIDLNITSPDTEGNETCIYSGFCNVKLIDSYDVNINEGDSNSQSSDSKDDHQIPVIVSEASQDPTFNQSQVWIDPARSPYVYNSYDAVESQEHYSVYSQTLASHITVEQHNTYITQNVYNYKEQVPYSPIEEFGHKKFRKNHGNEINHDLYEDDEFECGVLLSQLPEEIMNEKENKAREELMYINLQSENDALKQLMSSDIQSLSKQQKTIMYLGHDSHYGQTVQKIAVNDPSLECNGVGNEVIVESTQPNHKKYQCDKCKQFFDQITTFKQHMVGNHKSPKELFRPNPIREMKFMCTECGKHLKNQKKFELHCLGHGNPELECNKCHKVFASKFTLRNHLKIHQRKFPCMYCTKTYSSSEELRSHKAKIHFIFMCDICDYTATKYPDLEEHKQTHASEVDLSDSDFTLSLNGDIISDSGSLSPSTEDYIIRSEDFDQKLDEIKKADSVIAKVMSNKMFLLHTKKARRDKKYKKTCDVCLKTFDRIGDLKRHLIEHVIRSTLAKTPVNKNGTLTIQCEVCQVENFTKIDKYKAHLREHAKLTLYKCTFCDKSFSDSSNFSKHKKIHGTTFFQCDLCQRKFNSKKMIAQHMEYHNNNSPIPCPYCDKEFHFPSMLNKHVKCVHNREMSRFRCRFCHEYFKTLKDKWDHEWRIHNVRKIIVDCLICGSKFRKYSELKRHCTDVHGLDIPPAKKLLRKRRLL-