Monarch geneset OGS2.0

DPOGS215016
TranscriptDPOGS215016-TA1107 bp
ProteinDPOGS215016-PA368 aa
Genomic positionDPSCF300256 + 225833-229868
RNAseq coverage118x (Rank: top 58%)
Annotation
HeliconiusHMEL0101601e-12561.08% 
BombyxBGIBMGA012174-TA1e-9248.78% 
DrosophilaCG7372-PA1e-2437.57% 
EBI UniRef50UniRef50_D6X4Z73e-2427.56%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X4Z7_TRICA
NCBI RefSeqXP_001814647.16e-2527.56%PREDICTED: similar to zinc finger protein 617 [Tribolium castaneum]
NCBI nr blastpgi|1892417831e-2327.56%PREDICTED: similar to zinc finger protein 617 [Tribolium castaneum]
NCBI nr blastxgi|1892417834e-2827.79%PREDICTED: similar to zinc finger protein 617 [Tribolium castaneum]
Group
Gene OntologyGO:00056341.7e-12nucleus
GO:00082701.7e-12zinc ion binding
GO:00036764.1e-09nucleic acid binding
GO:00056229e-05intracellular
KEGG pathway 
InterPro domain[34-105] IPR0129341.7e-12Zinc finger, AD-type
[270-291] IPR0130874.1e-09Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25931 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215016-TA
ATGTTGCCTATATTTCTTCTAATAAATCGTATGGCGTCTCGACGGACTTCTAGCAAAGCTAAGCCGATGACAACTTTTAAAAAAGAATCCGGCAACGGCTTCTGCAGGACCTGCTTGAGCACTAAGAATCTAGAACTTATATTTAGTAAAGCGGAATCTAAAGACAAGAAATCTGAGGAACTACGTCTTGTAACGGGTTTAGAAATTAAATTCAACGATGGATTATCACAAAAAATTTGTACAGTCTGCATCAAAACAATCAAAACGGCGCTAAGATTCAGAAGGATCAGCAAGAAGACTGAAAAAACTCTTCTCAGTATGACGGTGAGATCGAAACAACACGCAAATCCAGGTACAAGTCTCCCAAAAGCCAATGTCATAAAGACGGAACCAGGATCCGATGCAGTGGAGCTCAAGAACGATAGAGAAGACACATCATATGATGCCGACTGCCAGAACTATGATTATTATAATTACACTTACAAAGAGGAACCGGAGACGTCAGAGAGAGAGATGCAAAAACCGACTCGAGGGAGACAGAAGTGTCTCGGACCCAACGGGACGTACAAGTGTCATGTCTGCGGTAAGGAGTTCAGGATGAAGGCCACTTACACAGCACATCTGAGATTCCACACACACTACTGTGTTTGCGAGGCGTGTGGGAAGCGCTGCCGGAACAACAACCAACTGCAAGAACACAAGCGAGCTCGACACGGCCTGGGGAAGATTCATAAATGCGCCTACTGCGAGTACAGCTCCGCGACCAAGGAAGCTCTCATCATCCACGAGCGGCGTCACACGGGAGAGCGGCCTTACGTCTGCGATCACTGCGGCGCCTCCTTCTTCAGGCGGTCCACGCTAGTGCAGCACATCGCCATACACCTGCCAGAGAAGAACTTCCAGTGCGACATGTGCCCTAAAAGGCTGAAATCCAAGAAGTTCCTCCAAATCCACAAACACAACGCGCACACGGGCAAGAGGTACGGCTACCTGTGTTCCGTGTGCGACCACCGCTTCGAGAAGCCAAATAAGGTGCGCGCGCACACGAGGCGAGTCCACGGACTGCCGGACGAGCAACAGGGACCCATAGTACGGATCATACTATAG

Protein sequence:

>DPOGS215016-PA
MLPIFLLINRMASRRTSSKAKPMTTFKKESGNGFCRTCLSTKNLELIFSKAESKDKKSEELRLVTGLEIKFNDGLSQKICTVCIKTIKTALRFRRISKKTEKTLLSMTVRSKQHANPGTSLPKANVIKTEPGSDAVELKNDREDTSYDADCQNYDYYNYTYKEEPETSEREMQKPTRGRQKCLGPNGTYKCHVCGKEFRMKATYTAHLRFHTHYCVCEACGKRCRNNNQLQEHKRARHGLGKIHKCAYCEYSSATKEALIIHERRHTGERPYVCDHCGASFFRRSTLVQHIAIHLPEKNFQCDMCPKRLKSKKFLQIHKHNAHTGKRYGYLCSVCDHRFEKPNKVRAHTRRVHGLPDEQQGPIVRIIL-