Monarch geneset OGS2.0

DPOGS209176
TranscriptDPOGS209176-TA987 bp
ProteinDPOGS209176-PA328 aa
Genomic positionDPSCF300061 + 130765-131751
RNAseq coverage207x (Rank: top 46%)
Annotation
HeliconiusHMEL0097482e-17589.94% 
BombyxBGIBMGA011532-TA3e-6276.22% 
DrosophilaCG7372-PA6e-2531.43% 
EBI UniRef50UniRef50_D6WJ559e-2837.57%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WJ55_TRICA
NCBI RefSeqXP_001815479.12e-2837.57%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp) [Tribolium castaneum]
NCBI nr blastpgi|1892378603e-2737.57%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp) [Tribolium castaneum]
NCBI nr blastxgi|2700079785e-3137.79%hypothetical protein TcasGA2_TC014726 [Tribolium castaneum]
Group
Gene OntologyGO:00036761.2e-12nucleic acid binding
GO:00082702.1e-05zinc ion binding
GO:00056222.1e-05intracellular
KEGG pathwayxtr:4965586e-28 
 K10500 (ZBTB17, MIZ1)maps-> Cell cycle
InterPro domain[236-269] IPR0130871.2e-12Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL34793 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209176-TA
ATGCAGAACAACCCCCAGGAGCTTTCATGGCAGGCGATTCAGGCTATAGTGGGGGTTAAGCTCCCACCTAATGTGTTGCGTTTTGATGATATCGGGTTCCATTTAGTCAACAATTCGGTTTCAAGTGCCGAAGATTGGGATTCAGACGTAGAAATAGTGGAAAATGGCTGGGAATCAGACAATAACGTATCTTCTAAACCCGAAAACTCTGACAATGAAAAGACGTCTCCGGATAGAGAGCAGAGCGCGTCGCCTGTTTTAAGTCTCGTAGAACCCGACAATAACAATTCAGGTAAAGGCCTACTGCATTATACGGATTTTTTAGTGAAAATGGATACCAAAGACATTACTGATAATGAAATGACCGAAAATGATCCAGTGTCCCAGAGTAATGATGACGCAGATAGCTCAAATGGAAATGTAATGTCCATTGAAAACTATTTAGAGGTCACTCTGTCAACTGACTCGGAGGCTAGTCATGTATGTAGCCAATGCAACCAAATGTTTCCCAGTGAACAGCTTCTGTCTTCACATCAGTGTAGCGTGAAGCAAATCGAAGAAAAAAAGTATCCTTGTCATGTTTGTGCCGAGAAGTTCACAAGCTACTGGGAACTAAGAAAGCATATAAACAATCACTTCCCCGGAATGCTGGACTCCAAGTCGAGCTTCTGTCATTTGTGCCAAAAAGATTACACTAAAACAGGGTTTATGAATCATTTGAGAAAGCACACCGGTGAACGACCGTTTGTATGTGAGCTCTGCCACAAGGCCTTCTCCCAGTCGAGTTCTTTATCCATACATATGAAGTTCCATCTTAACGTCCGCAAACACGCGTGCACAGTTTGTGAGAAAAAGTTTGTGACCAAGAGTGAACTGTCCCGTCACATGACGGTGCACACAAAACAGAAGTCCTACTACTGTGGAGTGTGCGACAAGGCCTTCACTCGCTCCGACAACATGAAGAAACATGAAAAGACCCACGGATGA

Protein sequence:

>DPOGS209176-PA
MQNNPQELSWQAIQAIVGVKLPPNVLRFDDIGFHLVNNSVSSAEDWDSDVEIVENGWESDNNVSSKPENSDNEKTSPDREQSASPVLSLVEPDNNNSGKGLLHYTDFLVKMDTKDITDNEMTENDPVSQSNDDADSSNGNVMSIENYLEVTLSTDSEASHVCSQCNQMFPSEQLLSSHQCSVKQIEEKKYPCHVCAEKFTSYWELRKHINNHFPGMLDSKSSFCHLCQKDYTKTGFMNHLRKHTGERPFVCELCHKAFSQSSSLSIHMKFHLNVRKHACTVCEKKFVTKSELSRHMTVHTKQKSYYCGVCDKAFTRSDNMKKHEKTHG-