Monarch geneset OGS2.0

DPOGS208485
TranscriptDPOGS208485-TA1383 bp
ProteinDPOGS208485-PA460 aa
Genomic positionDPSCF300064 - 1108810-1113290
RNAseq coverage85x (Rank: top 63%)
Annotation
HeliconiusHMEL0083128e-3624.61% 
BombyxBGIBMGA010321-TA1e-4054.48% 
DrosophilaMeics-PA2e-3026.10% 
EBI UniRef50UniRef50_D2A2077e-4230.90%Putative uncharacterized protein GLEAN_08440 n=2 Tax=Tribolium castaneum RepID=D2A207_TRICA
NCBI RefSeqXP_001815603.12e-3827.97%PREDICTED: similar to Zinc finger protein 26 (Zfp-26) (Protein mKR3) [Tribolium castaneum]
NCBI nr blastpgi|2700062682e-4130.90%hypothetical protein TcasGA2_TC008440 [Tribolium castaneum]
NCBI nr blastxgi|2700062682e-4930.59%hypothetical protein TcasGA2_TC008440 [Tribolium castaneum]
Group
Gene OntologyGO:00036763.2e-12nucleic acid binding
GO:00056341.2e-09nucleus
GO:00082701.2e-09zinc ion binding
KEGG pathway 
InterPro domain[416-446] IPR0130873.2e-12Zinc finger, C2H2-type/integrase, DNA-binding
[4-69] IPR0129341.2e-09Zinc finger, AD-type
Orthology groupMCL30742 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208485-TA
ATGTTAAAATGTCGAGCGTGTCTTAAAGATGAAGAATATTTGGTGCCATTTAATGACCAAGATATTGATTACTTTAATTTACTTACTGATATAAATGTTAAAATGTTTGATAAACTACCACAACATTTTTGTATATGTTGTTTAGACAAACTTAAGTCCTTTATTGAATTTCGAGAAAAATGTTTATTGTCTAATTATATTTTAACAGAATCCATCAAAGAAGACAATATCAAGGTAGAAATAGAGAATACTCTCTTTGTGCAAAAAAGTGAAGTAAAAGAGGATTGTATTGATGAGATATTAGAAGGAGGTTTGATTGATAATGAAGAAACAGATTATGATTTATTCAATAATGTTAATATTAAAACAGAAGACAGTGAAGATATGAGTGGAAAGCTAACAAGACCTTCTAACATAACAAGAAAGAAAACACTTCTGTCTTGTGGACTCTGCATCAAAACTTTTAGAGAAGCCGATCTACTAACACTCCATCTGGCATGTCATAAAATTAGTAATTCGTGCAAAATATGTTCCCAAAATTTCACTGAGTGGCCGGCATTATACGCTCACCGATTGGAACACCTGACAAACAAACAAAAACATTGCCACATATGTTTGAAAAAATGTTACAAACCACAATATCTAGAATACCATTACAGAAAGTTTCACACAGAAGACAAAGATTACACTGTAAAATGTTCTGAATGTAGTCACACCTTTACAACACCGAAGAGATTACAGAAGCACATGTGGGCCAGCCATTCCAACAGGAAGTTCTACTGTGATCATTGTCCTAAAATATTCAAAAATAAAAGTGCCATTAAATCGCATATGCCAATTCATATGGACAATAAAAATGTGAAGTGCGGTTTGTGCGACTACACCTGCAAGTATCTCAGTAATTTACAAATTCACAAGCTAAGACGACATACACCACAAAAAGCATATTGCAAGAAATGCACTCGCGTTTTCTGTGATAAAGAAAAGTTGGACGGTCACAAATGCTCAGAAAAAGGTACCATATGTCCGGTGTGCGGTAAAATTGTGAAAACGCACAGATTGTTACGACGCCATATGGAAAGTCACGATAGCGCGGGAAAGTACAAATGCGAGCGGTGTCCAAAAGTGTACAAGTCGAAGAGCTCACTAGCGACACACAAGCTGATACACGACGGGGTGCGAACTAAACAGTGTGAATATTGCAACGCAAAGTTCTTCTCGGGATCTGTTCTTATAAAACACAGAAGGATACATACCGGTGAAAAGCCGTACGTGTGTCGGGTTTGTTGTCGAGGTTTCACTAGCAACCACAACTTGAAGGTGCACATGAGAGTCCACGGCGAATACTTGATAGAAAGAAAAAAAACTGATGATGGTGCTTGA

Protein sequence:

>DPOGS208485-PA
MLKCRACLKDEEYLVPFNDQDIDYFNLLTDINVKMFDKLPQHFCICCLDKLKSFIEFREKCLLSNYILTESIKEDNIKVEIENTLFVQKSEVKEDCIDEILEGGLIDNEETDYDLFNNVNIKTEDSEDMSGKLTRPSNITRKKTLLSCGLCIKTFREADLLTLHLACHKISNSCKICSQNFTEWPALYAHRLEHLTNKQKHCHICLKKCYKPQYLEYHYRKFHTEDKDYTVKCSECSHTFTTPKRLQKHMWASHSNRKFYCDHCPKIFKNKSAIKSHMPIHMDNKNVKCGLCDYTCKYLSNLQIHKLRRHTPQKAYCKKCTRVFCDKEKLDGHKCSEKGTICPVCGKIVKTHRLLRRHMESHDSAGKYKCERCPKVYKSKSSLATHKLIHDGVRTKQCEYCNAKFFSGSVLIKHRRIHTGEKPYVCRVCCRGFTSNHNLKVHMRVHGEYLIERKKTDDGA-