Monarch geneset OGS2.0

DPOGS211326
TranscriptDPOGS211326-TA1437 bp
ProteinDPOGS211326-PA478 aa
Genomic positionDPSCF300125 + 159012-166441
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0024451e-11875.18% 
BombyxBGIBMGA000521-TA3e-3932.54% 
DrosophilaCG12299-PA1e-3530.96% 
EBI UniRef50UniRef50_E0VG311e-4536.43%Zinc finger protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0VG31_PEDHC
NCBI RefSeqXP_001943533.11e-4635.69%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp) [Acyrthosiphon pisum]
NCBI nr blastpgi|3287068191e-4632.52%PREDICTED: zinc finger protein 91-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287068197e-5232.83%PREDICTED: zinc finger protein 91-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00036761.1e-12nucleic acid binding
GO:00082703.7e-05zinc ion binding
GO:00056223.7e-05intracellular
KEGG pathway 
InterPro domain[352-381] IPR0130871.1e-12Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL34876 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211326-TA
ATGATGTTCGACCCACCTGATTTCCCGTATAAAGAATTAACAGACACCCTAGGAAAATCTTTTAATGTTGAGACTGTCTCGACTTTGAATGTAGCACAGACTAGAGACGAACAAAACTTAGAGCTTTATGTTGTGCAATCAGAAAATTTGTTCAACTACGAGCCTACGTTATTAGATGATAATACGGTCAATCAATTTCTGCTGCCGATTCAGGAATCTTCGGATACTGCTAATAAGGAGCCAGAAGCCGGCCCGGACCTCCTGGCTCTCCATTCGTGTGTTATTTGCCATGAGATCTTCACGACGGATGCTGAGTTACTGGACCACACAGTTCTCGTGCACGCAACCGTTCCTACATCAGATCAGGAACCGAGCACCTCAACAAGTGTAGAACATATTGATACACAAGAATCTAGCTTCGAGTGTGAATGGTTGGTGTGCGGTGTGTGCGAGCGTGTGTTGTCAACTGCGCTCGACTCGCAGCCCCGCGAACTATTATGCTGCACGGAAGCGAGAGATGGAAGCAATATTGGTTTACGAAAGTTGGCCACCATGTTTGTGTGCGATTACTGCAGCTACCTGTTCGCTAACGGCGAGGCTTTGGATAGACATCGCCTGGCGTGTTTCGCACATGAAGATTACTATCCTATACTAATGAATCCTATCTTGGAATCAGTTGCTCCTCATGTATCTGAACATCCGTGTGGTGTGTGTGGGAGACAATTTGTTAACGGCGAGGAGCTCAGGGCGCACACGATACAGTGCCAGCCTCGGAGACGTCGGCCTTACTCCGGGAACACCAACAAGATGTGGAACTGTGGATCCTGCAACCAACTATTCACGACCGCTAGGGAGCTGTATCGACACAAGCGCGGCGAGGATCGTCCGCCCGGCGCCCCCCTCACCGCGTACGTGTGCGAGGACTGTGACAAAGTACTGGGGAGCATGTGCGCGCTGCACACACATAAAAAGATGCACAAAGTTTCTAAGCCAGGCTATCCGTGTCGCGTGTGCGGGAGACGCTTTAACCAGAGCGGTCACCTCGCTATCCACATGCGCATGCATACGGGAGAAAGGCCTTACCCGTGCGACCTGTGCGACAAGGCCTTCAAAGTTAAGGTGGAGCGCGATGACCACCGGCGTACTCACACCGGCGAGCGACCGTTCGCCTGCACCTCTTGTAGTAAAACTTTCACCGCGGCCGCCAGGCTCAGGGAACACGCCAGGATACACACGGACCAGAGGCCGTACAAATGCGAAATTTGCGGGGCAGCCTTCCGGCGGCCGTACGCTCGCACAGTCCACACACTCATACACACGGGGGAAAAACCTCACGAGTGTGACGTTTGCGGCACAGCGTTCAGACGTTCGGGCGATATGTGGAAACACAAGAGGACTTTACACGGCATCCAGGGTGCAAGCGGTGATACCAATTGA

Protein sequence:

>DPOGS211326-PA
MMFDPPDFPYKELTDTLGKSFNVETVSTLNVAQTRDEQNLELYVVQSENLFNYEPTLLDDNTVNQFLLPIQESSDTANKEPEAGPDLLALHSCVICHEIFTTDAELLDHTVLVHATVPTSDQEPSTSTSVEHIDTQESSFECEWLVCGVCERVLSTALDSQPRELLCCTEARDGSNIGLRKLATMFVCDYCSYLFANGEALDRHRLACFAHEDYYPILMNPILESVAPHVSEHPCGVCGRQFVNGEELRAHTIQCQPRRRRPYSGNTNKMWNCGSCNQLFTTARELYRHKRGEDRPPGAPLTAYVCEDCDKVLGSMCALHTHKKMHKVSKPGYPCRVCGRRFNQSGHLAIHMRMHTGERPYPCDLCDKAFKVKVERDDHRRTHTGERPFACTSCSKTFTAAARLREHARIHTDQRPYKCEICGAAFRRPYARTVHTLIHTGEKPHECDVCGTAFRRSGDMWKHKRTLHGIQGASGDTN-