Monarch geneset OGS2.0

DPOGS203814
TranscriptDPOGS203814-TA1752 bp
ProteinDPOGS203814-PA583 aa
Genomic positionDPSCF300010 + 2112593-2119072
RNAseq coverage300x (Rank: top 37%)
Annotation
HeliconiusHMEL0133240.058.33% 
BombyxBGIBMGA003721-TA2e-13855.19% 
DrosophilaCG10366-PA3e-4036.02% 
EBI UniRef50UniRef50_E0W4E31e-5138.19%Zinc finger protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0W4E3_PEDHC
NCBI RefSeqXP_002433237.12e-5238.19%zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420256504e-5138.19%zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420256504e-5933.73%zinc finger protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00056341.4e-11nucleus
GO:00082701.4e-11zinc ion binding
GO:00036761.6e-10nucleic acid binding
KEGG pathway 
InterPro domain[8-79] IPR0129341.4e-11Zinc finger, AD-type
[388-418] IPR0130871.6e-10Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL22040 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203814-TA
ATGGCTTTTAATATTAAAGACTTGTGTCGATTATGTGCCAGGAAAAACGAGTTTTCCAAAGATTTGTTAGATACCGCCAATCATAATGTTCTTAAAATGGTTCTAGGATACATAGAAATATCAATTAATAATAATGATGATTTACCATCTAAAGTTTGTATAAACTGTGAAGAAAAAGTTACATCTTTCCAACTTTTTGTTGTGGAGTGTTTGAAAGTACAGGAAACTTTAAAACGAATGGTTTTAGATAATGTATGTAATGAACCTCTAATAAAACTTGAACAAAGGATTGACAATTACTCATACAACTTACCAGAGATTAAATCCGAAGTTAAAGATGAATTACAAACTGACGAATTGACAAGTGAACTAAACAGTAATGAATTATTAGTTGAAATGAATTGTGATGAATTAAATAATGATAAGTATGATAGTGATGTTGGCAGTGATTTGAGTGATGTAGCACTGGCAACGCTGAAGGAGGCTAAGGACGAAAAGAAAAATAAAATATTAACAGAGGAAGAGGCCAACATATTCAAGAACGTTTTAAAAAAAAGACACCTCACTATAAAAGATTTTGTCAAATTAAAATGTGAAGAATGTGAGAATTGTATGAAGACTTGGACTGCTTTGTGGTGTCATTATTATACTATACATAAAACCAGGCCATACTTGTATTGTATATGTGGATTTGTTATAACATCGAAGATAGCTTTATACAGACATGTGTCCGACCATAAGGAGGAAGCGAGGAAGTATAGGAAAATGGATATAGGAACAGAAAAACAGAATGAGAAGTACTCCAATTATAATGTCAATGATTTTGTTAATTGTGATAAATGTCCAAGAATATCAAAGTCTGAAGCTGCGATGGCCAAACATAAATTAAGACACATACCAAAGTCAGAGAGGAAGTTTAATTGTAATTCTTGTGATAAATTCTTTAATTCAAAGGAACTTTTGAAATCCCATGCGAAATCTCACATACCGATAGAAGAAAGAAAAATATACCGTTGTGATATATGTTTCTTAAAGTTCACAACCCGGTCTTCGTGTGCGTCTCACAAGCGTATAGTCCACGACAAGATCAAGAGCTACGTGTGTGACCTGTGTGGGTACGCGTGTGGTACCGGCGGGGAACTGAGACAACACACGGCCATACACAGCGAGGATAAACCCTTCACATGCATTAAGTGTTACAAGTCGTTCAAGACGTACTCTAATCTGAAGACTCACATGGACACCCACGAAGACACCTCGTACGCGTGTCACGTTTGTAACAGAGTACTGAACAGTCGCAGGACGTTGAGGAAACATCTTCTGGTGCATGAGGACAAATGTCAACATGTCTGCTCCTACTGCAACAAAGCCTTCAAGAGACGGCAGACCTTGAAGGTGCACATGCACACACACACCGGGGACAAGCCGCTCAGCTGTAAGTGGTGCGACGAACGTTTCTCATACGCCTCCACGCTTCGCTCGCACCGTTTAAGATGTCACCCGGACAAGATGGCGGCCAGGTATCCGCCATACCTTACACTTGTGCACATGCACACACACACCGGGGATAAGCCGCTCAGCTGTAAGTGGTGCGACGAACGTTTCTCATACGCCTCCACGCTTCGCTCGCACCGTTTAAGATGTCACCCGGACAAGATGGCGGCCAGGTATCCGCCATACCTTACACAAGAGGGATACCTTAAATCTGATGCAGCTCCCGTCATGAAGGGCGACTTGGAGGCTATACAGTAG

Protein sequence:

>DPOGS203814-PA
MAFNIKDLCRLCARKNEFSKDLLDTANHNVLKMVLGYIEISINNNDDLPSKVCINCEEKVTSFQLFVVECLKVQETLKRMVLDNVCNEPLIKLEQRIDNYSYNLPEIKSEVKDELQTDELTSELNSNELLVEMNCDELNNDKYDSDVGSDLSDVALATLKEAKDEKKNKILTEEEANIFKNVLKKRHLTIKDFVKLKCEECENCMKTWTALWCHYYTIHKTRPYLYCICGFVITSKIALYRHVSDHKEEARKYRKMDIGTEKQNEKYSNYNVNDFVNCDKCPRISKSEAAMAKHKLRHIPKSERKFNCNSCDKFFNSKELLKSHAKSHIPIEERKIYRCDICFLKFTTRSSCASHKRIVHDKIKSYVCDLCGYACGTGGELRQHTAIHSEDKPFTCIKCYKSFKTYSNLKTHMDTHEDTSYACHVCNRVLNSRRTLRKHLLVHEDKCQHVCSYCNKAFKRRQTLKVHMHTHTGDKPLSCKWCDERFSYASTLRSHRLRCHPDKMAARYPPYLTLVHMHTHTGDKPLSCKWCDERFSYASTLRSHRLRCHPDKMAARYPPYLTQEGYLKSDAAPVMKGDLEAIQ-