Monarch geneset OGS2.0

DPOGS214489
TranscriptDPOGS214489-TA1416 bp
ProteinDPOGS214489-PA471 aa
Genomic positionDPSCF300122 + 91663-106674
RNAseq coverage118x (Rank: top 58%)
Annotation
HeliconiusHMEL0092300.097.25% 
BombyxBGIBMGA013390-TA8e-9096.11% 
Drosophilarn-PE6e-9470.34% 
EBI UniRef50UniRef50_B0XLM03e-11254.91%Zinc finger protein 36 n=3 Tax=Culicinae RepID=B0XLM0_CULQU
NCBI RefSeqXP_001655944.11e-11857.71%hypothetical protein AaeL_AAEL002763 [Aedes aegypti]
NCBI nr blastpgi|1571318052e-11757.71%hypothetical protein AaeL_AAEL002763 [Aedes aegypti]
NCBI nr blastxgi|2700156551e-12163.29%hypothetical protein TcasGA2_TC016018 [Tribolium castaneum]
Group
Gene OntologyGO:00036766.4e-14nucleic acid binding
GO:00082703e-05zinc ion binding
GO:00056223e-05intracellular
KEGG pathway 
InterPro domain[203-234] IPR0130876.4e-14Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL14633 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214489-TA
ATGATGGAAGGGCGCACGATCGACTACAGGCCCGACGGTGGAGGTCTTGACTACCACAACTCACCGCTCATGGCAGAAATACCAATAGACAATTACTCCCACATCCACAGAAGTATAGAGCATTTAAGATCAATAGGCATGGCACCACCAATAGATTCGCACAGACATTTAGCAGCTAACCTGACGGATCTTAGAAGATACGAACATCCAGATGATATGCCGGAGATAAAACCGTCCGTTCTGAGGCTGTCGGAGTTCAAGAGCGGCCTCCAGGACATGAGAATCCACAACGACCAAGAAAATGATTTACAGCGTCAAATGACACCCCACGACGATGGCAAAGGATACAGCGCGCCGTCGACGCCTCTATCAGAAAACGGTGGACAAAGCATCCAAGAAGAGAAGGTGTTCGGCTCCAAAGCGGACCTACAGCTGCACACACAGATTCATCTCCGCGAAGCGAAGCCTTACCGATGCACGCAGTGCCCGAAGGCTTTTGCAAACTCGTCGTATCTGGCGCAACACTCCCGCATCCACTTAGGTATCAAACCTTATCGCTGCGAGATCTGCCAGCGTAAATTCACGCAGCTGTCTCATCTTCAGCAACACATCCGCACCCATACCGGCGACAAACCCTATCGATGCACTCAGATCGGATGCACTAAAGCCTTCTCCCAACTCTCTAACCTCCAAAGCCACAGTCGCTGTCACCAAACAGACAAACCGTTCAAATGTAATTCCTGTTACAAATGTTTCACACACGAGAAAGATCTGCTGGAACACATTCCTAAACATAAGGAGTCGAAGCATCTCAAGACGCACATATGCCAATACTGTGGCAAAAGTTACACTCAAGAGACATACTTAAGTAAACATATGAACAAGCACGCGGAGAGAGCAGACAAGCGGCCGCCGATATCTGCATTAGGTCTGAGCGGTCTAAACAGGTCCCTGGCGGCGGCCGCACCCACTGCCGCTCCATTTGCTGACCACCCATACTGGCCGAAGGTCAGCCCGGACTCAGCGGCACACATGTCGGATGAGGGAGGCTATCACCAACAGAGAGAAAGCGACGAGCATCACGAACAACAGTTACAACAACAAAGGGCATTGTTCGCTCAGCACGAGAGCCAAGAAGACCGCATCCAGCCTCCAGTGTCATCAGCTGCGAATTCCGCCTTCACACCGATCAACTCGATGGCCCCTCACTTAAACGGACTGTCCCATCACAGTGCGCTGCCCACGAGACCTTATCTGTACGACCCGCTCCACTTCCAACAAGGGAAGCAGCAGCCGAACTCGTTTCCCAATCAGCTGATCTCCCTCCATCAGATCAGGAACTACGCCCACCAACCTTCCCTGCTGCCCGCGGAACACATCCTCCCCCATACTTTAGCTCACAAAGACAAACAGTAA

Protein sequence:

>DPOGS214489-PA
MMEGRTIDYRPDGGGLDYHNSPLMAEIPIDNYSHIHRSIEHLRSIGMAPPIDSHRHLAANLTDLRRYEHPDDMPEIKPSVLRLSEFKSGLQDMRIHNDQENDLQRQMTPHDDGKGYSAPSTPLSENGGQSIQEEKVFGSKADLQLHTQIHLREAKPYRCTQCPKAFANSSYLAQHSRIHLGIKPYRCEICQRKFTQLSHLQQHIRTHTGDKPYRCTQIGCTKAFSQLSNLQSHSRCHQTDKPFKCNSCYKCFTHEKDLLEHIPKHKESKHLKTHICQYCGKSYTQETYLSKHMNKHAERADKRPPISALGLSGLNRSLAAAAPTAAPFADHPYWPKVSPDSAAHMSDEGGYHQQRESDEHHEQQLQQQRALFAQHESQEDRIQPPVSSAANSAFTPINSMAPHLNGLSHHSALPTRPYLYDPLHFQQGKQQPNSFPNQLISLHQIRNYAHQPSLLPAEHILPHTLAHKDKQ-