Monarch geneset OGS2.0

DPOGS205470
TranscriptDPOGS205470-TA1233 bp
ProteinDPOGS205470-PA410 aa
Genomic positionDPSCF300166 - 23938-25766
RNAseq coverage66x (Rank: top 67%)
Annotation
HeliconiusHMEL0175253e-10949.19% 
BombyxBGIBMGA003260-TA5e-4736.30% 
DrosophilaCG6654-PA3e-4138.71% 
EBI UniRef50UniRef50_D6WI594e-5233.09%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WI59_TRICA
NCBI RefSeqXP_001813550.18e-5243.97%PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum]
NCBI nr blastpgi|2700042672e-5133.09%hypothetical protein TcasGA2_TC003594 [Tribolium castaneum]
NCBI nr blastxgi|2700042671e-6233.09%hypothetical protein TcasGA2_TC003594 [Tribolium castaneum]
Group
Gene OntologyGO:00056345.8e-15nucleus
GO:00082705.8e-15zinc ion binding
GO:00036766.8e-15nucleic acid binding
GO:00056223.5e-05intracellular
KEGG pathway 
InterPro domain[13-85] IPR0129345.8e-15Zinc finger, AD-type
[366-401] IPR0130876.8e-15Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL34602 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205470-TA
ATGGAGCGAGACGATGATCTCAAAAGTTATAATAATTGTTGTCGAACTTGTATGAAAGAGAGATCTAATATGTTCGGTATATTTATGGAAATAAATTCTACCAGTGCTGCACATATTTTGGCTACTTGCACTAATATAAATATCACTAAAGACGATTTGTTACCGAAGCAGATATGTTTCCATTGCTACAATTGTCTTGTAAGTTTTTATAAGTTTAGAAAACTAGCAGAAAGTATTGATGAGAAACTTCAGAGTGCTTTGTATAACAGATTATACAGCAGTTCAAGTGAGGATAATAAATTAGGTGAAATAAAAATAGAGCATGTTAATTACATTGATGGCGATATTCAACCAAAGGTAGAGGATAGAAGTGAGACTGAATCAAATTATAATAATAATACTAATAAAGACATTAAATGTAATATAAAGGCAAATGAAACAACAACACAGTCAAGTGAAGTCAATAAAAGATTATTACCGACGGTATCAAAGACTGAAAGCAAGGATGCAGCACATGATACAGATAATGTTAAATTCAAGTGTGAGATCTGTAGTAGAACATTCAAATCAATTAAATCCCTCTCCGCACATATGATCAAGCATACTAAGAAAGGCAGAATATTATCATGTAGTATATGCGGCAAGGAATTCAAAAAAGTCAGTCATGTTAAAAGACATGAAAAAATACATGAAATCAATCGGCCACACAAATGTGCTGTCTGTTCTAAATCATTTCCTAGCGAGGACATATTGAAAGAGCATTTAAACAAACACAATGGTGTAAAACCACATACATGCACATATTGTTCAAAGTCGTTTGCACATTTATTTACTCTGAAAGCACATATAAGGGTGCACACAATCGACAAGGCCTTCTTGTGTCCGACATGCGGAAAAAGCTTCTATTCGAGCACAAATTTTAAACAGCATATGAAAAGGCATGCTGGCTTGAAGACGTTTGCATGTGCAATGTGTCCAAAGATATTTATAAGTAAAGGTGAATTAAAATCCCATACCATAACACATACAGGTGAGAGGAATTGTACGTGTGATCAGTGTGGGTCATTCACAACCAAGGATCATTTGAAACGGCACTATAGAAGTCACACGGGTGAAAAGCCTTATAAATGTGATCTGTGCGAAAGAGCTTTCTCACAAAGCAACGACCTCGTCAAACATCGTCGTGTGCATTTGGGAGATAAAACTTATAAGTATGCTATAGATAATTTATAA

Protein sequence:

>DPOGS205470-PA
MERDDDLKSYNNCCRTCMKERSNMFGIFMEINSTSAAHILATCTNINITKDDLLPKQICFHCYNCLVSFYKFRKLAESIDEKLQSALYNRLYSSSSEDNKLGEIKIEHVNYIDGDIQPKVEDRSETESNYNNNTNKDIKCNIKANETTTQSSEVNKRLLPTVSKTESKDAAHDTDNVKFKCEICSRTFKSIKSLSAHMIKHTKKGRILSCSICGKEFKKVSHVKRHEKIHEINRPHKCAVCSKSFPSEDILKEHLNKHNGVKPHTCTYCSKSFAHLFTLKAHIRVHTIDKAFLCPTCGKSFYSSTNFKQHMKRHAGLKTFACAMCPKIFISKGELKSHTITHTGERNCTCDQCGSFTTKDHLKRHYRSHTGEKPYKCDLCERAFSQSNDLVKHRRVHLGDKTYKYAIDNL-