Monarch geneset OGS2.0

DPOGS210630
TranscriptDPOGS210630-TA2142 bp
ProteinDPOGS210630-PA713 aa
Genomic positionDPSCF300168 + 557983-560719
RNAseq coverage311x (Rank: top 36%)
Annotation
HeliconiusHMEL0071654e-10169.46% 
BombyxBGIBMGA013636-TA3e-10939.60% 
Drosophilacrol-PE3e-3927.16% 
EBI UniRef50UniRef50_D6W8M54e-4635.87%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W8M5_TRICA
NCBI RefSeqXP_001946669.11e-4027.12%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp), partial [Acyrthosiphon pisum]
NCBI nr blastpgi|3017768134e-4529.67%PREDICTED: zinc finger protein 425-like [Ailuropoda melanoleuca]
NCBI nr blastxgi|3266765312e-5627.20%PREDICTED: zinc finger protein 850 [Danio rerio]
Group
Gene OntologyGO:00036764.3e-10nucleic acid binding
KEGG pathway 
InterPro domain[460-491] IPR0130874.3e-10Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26102 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210630-TA
ATGAGCCCAAAACCTACTACAATCGATTTAGTCGGCGGAATCTACTGTAACATATGTAATTTGATATATGCAAATAAGAAGGATTACGACCTACATTACACTAAACACGAAACTGGCAGCAAGGACATAGTTTACACTTGCGTAGTGTGTAGAAAAGAAATCTCAGGCTATCCAAGCTTTCGTGGTCATTGTTACACAAGTCATGTCATCAAAGAGAGATTTAAATGTGAACACTGTTCAAAATTATTTTCAAAGTTTGCTTCACTACGAGAACATGTCATGGTAATGCATAGATTCAGATGCAACACTTGCAAGAAGGAATTTACATCAAAAAAAGAATTGAAATTGCATGAAATTATTCACAATGACAACGACAGTCCTCCTTATGAATGTAAAGCGTGCGGTGAGGAGTTAGACACTCTGGAAGACTGCAAGAATCATATAGACGTACACTCAGCATTCATATACTTTTGCCCCATATGTAATGAAAATATATCAAATAAAGAAAACTCCAGTGAGCATTTGAAGAAACATTTTGGACATGTGATTAATACAAATACTATTGAAACACAAAAGAGTCTGGACAAAGAGGAAAATTCAGTTGAGAGATTGGGCGGGATTTCGTGTAGTTATTGCTCCCAGACTTACAAGAACCGTATGGAATTTGATGCCCACTTCTCATGCGACCACGGAGACAAAGATATCATATACAGCTGTATTGTGTGTGCAAAGCAGTTTGAGAAGTATTCCGTATTTAGTCATCATGCTTACAATCACTTCACCAAAGGAAGATTTTGTTGTGACATATGCCGGAAGACATTCAACCGTCTCTCCCTGCTGGTGACTCACACGGCCGCCTGTCAGACCGATGCCGAATGTAAGGGGAAGCCGTTCACTTGCTATCAGTGTGGACACCGCTATGTGACGGAGATGAGGCTGAGGGAACATCTGAGGGATATACACGGTGTACACTGTGTCATCTGTCCGGAGGAAGGCTGTCAGGAAGTATTTGCCACACCAAAAGAATTGGTATTCCACCAACGTGCGCACCAATCCGACCGGAACTGGTGTCGCCAGTGTGGCCTGTTGTTCACCGGCCTCGCCTCCTGCGAGCGACATCTCGACGTCCACAAGAAGAAGCTGTATGTGTGTCCGGTCTGCAACAAGAACTACAGCGAGAAGCATCTCATACTGAAACATATCTCACAACATTTTGAAACTGTTTTGCACATTTGCAAAGTGTGCGGGAAGGTCTACAACGCCAAGAATCGTCTGATCGAACACTTCAAGTCGCACTCCGAGAACAAAACCCACAGTTGCACCTACTGCGACAAGAGTTTCGTGAAAATTGGCCAACTGCAGCAACATCTGAACATACACACGGGCTCCAAGCCATACAAGTGTCCGGTCTGCTCGAAAACGTTCGCCAGCTATCCCAACTGGCATAAACACTTGCGTCGAATGCACAACGGCGACGGAAAAAATTACAAGAAACCAGATATCGACAACGAAGAAGAGAACATCCACGATGAGAACGTGGAAGAATATCCTGGCGCCGATAGAAGTGTACAGAAGACAGACGCGTCTCGAGAAGACAAGCACACGCACCTCAAGGTAGACGCGGAACCGGCTGGTAATAAGGAATCCAGACTGGACACATACATCTATTACGAACCGAACGACAGCACCATGGAGTCGGACAGCATCGATCACGCCGTCATCGAGAAGGAGTTGGAAATATTCGAGAACACAAACGACGAGAGTATCGGCAACATCACGAAGTTTGTCAACGTCTTGGCCACGAACAATACGGTGTGCGTTGGCGAGAGTTCCGAGTCGTCGGCAGCCAGCGCCTTCCCCGCCGAGTACGGGCCAGAGTTCAGCGGGGTCATCGACCTGGACGACCACATGTTGCCTCACATCGATCCGCTGCTCATCAACAGCCAGCCGCCCCCGCCCTACGACCAGTTGGCCGACTCGTTGGTCGACTCACTGGCCGACTACACCGACAACTTCGCCGAAGCCTACGCCCCACCCAAATGGGAACCCATCATCACCAAGGTGTACCAGGACTACTCCTACGGTTACGGAGAAATGGTCGAGTCCAACCGACTGTCCATAATGAACACAGATATATTTTGA

Protein sequence:

>DPOGS210630-PA
MSPKPTTIDLVGGIYCNICNLIYANKKDYDLHYTKHETGSKDIVYTCVVCRKEISGYPSFRGHCYTSHVIKERFKCEHCSKLFSKFASLREHVMVMHRFRCNTCKKEFTSKKELKLHEIIHNDNDSPPYECKACGEELDTLEDCKNHIDVHSAFIYFCPICNENISNKENSSEHLKKHFGHVINTNTIETQKSLDKEENSVERLGGISCSYCSQTYKNRMEFDAHFSCDHGDKDIIYSCIVCAKQFEKYSVFSHHAYNHFTKGRFCCDICRKTFNRLSLLVTHTAACQTDAECKGKPFTCYQCGHRYVTEMRLREHLRDIHGVHCVICPEEGCQEVFATPKELVFHQRAHQSDRNWCRQCGLLFTGLASCERHLDVHKKKLYVCPVCNKNYSEKHLILKHISQHFETVLHICKVCGKVYNAKNRLIEHFKSHSENKTHSCTYCDKSFVKIGQLQQHLNIHTGSKPYKCPVCSKTFASYPNWHKHLRRMHNGDGKNYKKPDIDNEEENIHDENVEEYPGADRSVQKTDASREDKHTHLKVDAEPAGNKESRLDTYIYYEPNDSTMESDSIDHAVIEKELEIFENTNDESIGNITKFVNVLATNNTVCVGESSESSAASAFPAEYGPEFSGVIDLDDHMLPHIDPLLINSQPPPPYDQLADSLVDSLADYTDNFAEAYAPPKWEPIITKVYQDYSYGYGEMVESNRLSIMNTDIF-