Monarch geneset OGS2.0

DPOGS200297
TranscriptDPOGS200297-TA1287 bp
ProteinDPOGS200297-PA428 aa
Genomic positionDPSCF300026 - 405271-407436
RNAseq coverage56x (Rank: top 69%)
Annotation
HeliconiusHMEL0057962e-12251.50% 
BombyxBGIBMGA005571-TA5e-3345.18% 
Drosophilacrol-PE3e-2435.43% 
EBI UniRef50UniRef50_D0ABA31e-11649.45%Putative Zn finger protein n=1 Tax=Heliconius melpomene RepID=D0ABA3_9NEOP
NCBI RefSeqXP_001121357.14e-2933.69%PREDICTED: similar to CG15269-PA [Apis mellifera]
NCBI nr blastpgi|2613359184e-11649.45%putative Zn finger protein [Heliconius melpomene]
NCBI nr blastxgi|2613359181e-12649.56%putative Zn finger protein [Heliconius melpomene]
Group
Gene OntologyGO:00056341.7e-10nucleus
GO:00082701.7e-10zinc ion binding
GO:00036768.3e-08nucleic acid binding
KEGG pathway 
InterPro domain[23-99] IPR0129341.7e-10Zinc finger, AD-type
[355-369] IPR0130878.3e-08Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25150 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200297-TA
ATGAATACACTTGAATTAGAAAATATTAACAATAACTTAAACACGACTTCGGGTTCTCATATTGGCGTCTGTAGAACTTGCCTGTCGATAGCAACTGAAAAAAATATGAGTGATTTACCTGATGGTTTATGCGAGGATACTAAATCTTATCTCGATATTATGATGTTTTGCTTAAATTTACAGATCACACCAGATTCAAAAATCACAACAAAACTTTGTTTTAAATGTTATTCAAACATTATATCATATTATAGATTTAAGTCATTAGCATTAAAGAGCGATAAATACTTAAGATCTCTTGACCAAAATATAGATAGCAAAAATGGTGTTTTTGTTACAAAAGACATTAAATCTGAAGAACCTTATAATATCTCCAACGAACAGCAAAGTAATATGGATATGGAGTTAGATTTTGAAGTCAAAGAAGAAAATGACACGGAGGAGTTGCAGTCTGATGATGAACTGTTAAGTGTCATACAAAGAATAAAATATGAAAATGTTAAAGATGAAGAATCTAAAGAAAACGACGACAAGCCAATGAAATCAAAAAAATTGAAGAAGCTCAAGAAAAAACAGGAAACAACCCACCAAGTATGTGAAGAGTGTGGTAGAACAGTTCGTAACCTGAAGGAACACATGTACCTTCATCAGCCACTGCTTACTAGGAAGAGATACAAATGCAAAGTGTGCGAGAAAATGTTCTCGAGTTGCAGTGCCAGGTACAAACATTATAAAACAAAACACTTGGGCATCAAACAACACTGTAACGAATGTAATAAAGATGTAGTAAGTCTCTCAGCACACAGAATGGTAATTCACAATACAGAATCACTACCATACGAATGTGTATCCTGCGGTCGGAGGTTCATATCCCGGTCGTTGAGAGATCATCACATGTTGACACACACTAAAAATAGACCGCATCCCTGTGATCAGTGCGAGAAGACATTTAAAAGTAATTATACCTTGATGCAGCACAGAAGACAAGTTCATGATAAGGAAAAATCACACCTCTGCCAGTTTTGCTCGAAAAGGTTTTTTAAAAAATATCATCTACAGGTACATTTAAGAAGTCATTCAAAAGAAAAGCCATATGAGTGTCCGGACTGTGGTAAATTCTTCTCTATATATGATACTATACTGCTGTATTTGAAAAATCACATGATAAGCCACACAAAGGAGAAGAAATATGCCTGTAAGTACTGCGGAGTGCGCTTCGGGCGTTCGGACCATTGTTTGAGGCATCAGAGGACCGCTCATGAGAAACTCATTACAAATACAGCCTGA

Protein sequence:

>DPOGS200297-PA
MNTLELENINNNLNTTSGSHIGVCRTCLSIATEKNMSDLPDGLCEDTKSYLDIMMFCLNLQITPDSKITTKLCFKCYSNIISYYRFKSLALKSDKYLRSLDQNIDSKNGVFVTKDIKSEEPYNISNEQQSNMDMELDFEVKEENDTEELQSDDELLSVIQRIKYENVKDEESKENDDKPMKSKKLKKLKKKQETTHQVCEECGRTVRNLKEHMYLHQPLLTRKRYKCKVCEKMFSSCSARYKHYKTKHLGIKQHCNECNKDVVSLSAHRMVIHNTESLPYECVSCGRRFISRSLRDHHMLTHTKNRPHPCDQCEKTFKSNYTLMQHRRQVHDKEKSHLCQFCSKRFFKKYHLQVHLRSHSKEKPYECPDCGKFFSIYDTILLYLKNHMISHTKEKKYACKYCGVRFGRSDHCLRHQRTAHEKLITNTA-