Monarch geneset OGS2.0

DPOGS200074
TranscriptDPOGS200074-TA1536 bp
ProteinDPOGS200074-PA511 aa
Genomic positionDPSCF300044 - 359239-365794
RNAseq coverage155x (Rank: top 53%)
Annotation
HeliconiusHMEL0043151e-12952.14% 
BombyxBGIBMGA004594-TA5e-12648.99% 
DrosophilaCG3281-PA6e-3235.37% 
EBI UniRef50UniRef50_D6WI121e-5841.69%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WI12_TRICA
NCBI RefSeqXP_002429661.11e-4840.00%krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700045934e-5841.69%hypothetical protein TcasGA2_TC003957 [Tribolium castaneum]
NCBI nr blastxgi|2700045933e-6342.04%hypothetical protein TcasGA2_TC003957 [Tribolium castaneum]
Group
Gene OntologyGO:00036764.3e-12nucleic acid binding
GO:00056348.9e-10nucleus
GO:00082708.9e-10zinc ion binding
GO:00056225.4e-05intracellular
KEGG pathway 
InterPro domain[436-463] IPR0130874.3e-12Zinc finger, C2H2-type/integrase, DNA-binding
[4-73] IPR0129348.9e-10Zinc finger, AD-type
Orthology groupMCL20421 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200074-TA
ATGGACAATGTTTGTCGCTTATGCTGTTCTGCGAAATTCGTCAATAATTATATTTTTGATGAAGGAAATGCTTTGTATTTGAAGATGTCATTATATTTGCCAATAAAGGTTTTCAAAAACGACCGGCTTCCGCAAAAGATTTGTGACAAATGCAGTTGTAAAGTGAATGACTTTTATCAATTTTGCAACGAAACTATAGAAGTTCAAAACAGGTTAAGAGGTCTGCTTCGACAAACCATACTCGATGACAACACAGTCGATTTCAGTATAAAAGAAAACATACACTCACCGTCAAGGCTAGCTATCTGTGAGCGGTCGACACAGACAAATTTCAATGATACTTTCCAAGTGAAAACTGAACCTAGGTCAGAAACACCATTGATTGTGAAGGAGGAAAGCTGTGATGAGGATTTGAATAATGTTGCATCCGATGCCAGCGAGGATGAATTGCTAATAGAAATCAAGAAGAAGAAAAAGCCAAGAAAAGTCAACAGCAATGTCAATGAAAAGAAAGATAATAAAAGGAACAGGAAAAGAAAGGAAAGACAGGACTGGCCGGATGATGGGTTCGATAACGGCTTAGATTTGGGTGTTGTTAAGGAGGAAGTGGACTTAGTTAAACAAGAAGCATCGGATTACACATGCTGCATGTGTTTCGAAAAGTGCTCCAACAAGACGGAAATGTTGCAACATTATAAAAATCATGGTGATTCAGCAGCGGCCCCGCAGACCGCGCCACCGCCTCCACAAGGCGACGTGCAGCGCTGTCTGCGCTGTCAAAAGGTTGTGTCGGGGGGTCTGGCTGAGTGGTCGGAGCACTGGAAGCGTCATTACGCCCGAGACACGAGGCCTTACAGATGTGTGCTCTGTGCCAGAACCTTCAGGGACCACCACCTTATATTGAAACATGGGCTCACTCATCAGGCGGATATCGAGGATAAGTCGTTAGCCCCGGGGGGCGGAGTCCTACCGGACAAGCGTTTCGTCTGCGACGTCTGTCCTGAAGGCTTTCCATACCTCCGTTGTCTGTTGGCCCATCGGACTAGAGCCCATCCAGAAGCCCTAAACCGGGCTGCGAGACTCCGCTGTGGGGTCTGCGCCCGGGGCTTCGCCCACTGCAACTCACTTAGGAGACACCTCAGAGCACATTCCGGTGAAAGGAATTTCCTTTGCAATGTATGCGGCAAAGCTCTAACATCGAGGGAGCATCTCAAGTTCCATATACGGATTCACACCGGCTACAAGCCTAATGTATGCAAGGTCTGCGGTAAGGGCTTCGTGAAGAAGTGTAATTTGACATTACACGAGCGAGTGCATTCGGGTGAGAAGCCACACGTGTGCCCGCACTGTGGGAAGGCATTCTCCCAGAGGTCGACGCTAGTTATACATGAGAGATACCACAGCGGCGCTCGTCCGTACACTTGCGGTTTGTGCGGTCGGGGCTTTGTGGCTAAGGGTCTTCTGTCGATGCACCTGAAGAGCACTTGCGTAGATACAACGCAGGCTAGATCGCAACAAAAGTTGAATTCGCGATAG

Protein sequence:

>DPOGS200074-PA
MDNVCRLCCSAKFVNNYIFDEGNALYLKMSLYLPIKVFKNDRLPQKICDKCSCKVNDFYQFCNETIEVQNRLRGLLRQTILDDNTVDFSIKENIHSPSRLAICERSTQTNFNDTFQVKTEPRSETPLIVKEESCDEDLNNVASDASEDELLIEIKKKKKPRKVNSNVNEKKDNKRNRKRKERQDWPDDGFDNGLDLGVVKEEVDLVKQEASDYTCCMCFEKCSNKTEMLQHYKNHGDSAAAPQTAPPPPQGDVQRCLRCQKVVSGGLAEWSEHWKRHYARDTRPYRCVLCARTFRDHHLILKHGLTHQADIEDKSLAPGGGVLPDKRFVCDVCPEGFPYLRCLLAHRTRAHPEALNRAARLRCGVCARGFAHCNSLRRHLRAHSGERNFLCNVCGKALTSREHLKFHIRIHTGYKPNVCKVCGKGFVKKCNLTLHERVHSGEKPHVCPHCGKAFSQRSTLVIHERYHSGARPYTCGLCGRGFVAKGLLSMHLKSTCVDTTQARSQQKLNSR-