Monarch geneset OGS2.0

DPOGS214851
TranscriptDPOGS214851-TA1311 bp
ProteinDPOGS214851-PA436 aa
Genomic positionDPSCF300091 - 124054-127341
RNAseq coverage110x (Rank: top 59%)
Annotation
HeliconiusHMEL0150172e-15963.45% 
BombyxBGIBMGA010074-TA1e-9247.83% 
Drosophilanopo-PA7e-4427.71% 
EBI UniRef50UniRef50_B0WCA96e-4530.46%Putative uncharacterized protein n=1 Tax=Culex quinquefasciatus RepID=B0WCA9_CULQU
NCBI RefSeqXP_001959242.16e-4630.02%GF12143 [Drosophila ananassae]
NCBI nr blastpgi|1947538981e-4430.02%GF12143 [Drosophila ananassae]
NCBI nr blastxgi|1700369891e-5030.46%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00055159.2e-08protein binding
GO:00082709.2e-08zinc ion binding
KEGG pathway 
InterPro domain[5-49] IPR0130831.2e-12Zinc finger, RING/FYVE/PHD-type
[5-45] IPR0018419.2e-08Zinc finger, RING-type
Orthology groupMCL11169 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214851-TA
ATGCATATCCTTTGCACTATATGCAGCGACATCGTAAATCAAGCGGAAAATATTTATGTGACAAAATGTGGACATGTTTTTCACTATAACTGTTTATCAAAATGGATTGCACGATCGAAATCATGCCCACAATGTCGCAATAAGGTCACAGACAAATGTATGTTTCGTTTTTATCCAACAATATCAAATGAAGCTACAAATGAAGACGCAGCCACATTACAATCACGTCTTGACGATGCACAGTTACAACTCCGCCAACAGAAAGCTAGTTGCAAAGAACATGAAGAAAAAATTTCAGCTACAGAGGCCGAAATCAAAAAGAATTTGGCATTATTGAAAGCATGTGAAAAAAAACTTGAAAGCCGAGACACAGCTATAGCAGCGCTCAAGGAACAGCTGCAGTATGTGAAGATACAGAACAATGAAACCAATAGGTTGAGGGAAGAAAATGAGGTTTTGAAGAAAAACATGCAGACATTGAACGGTTTACAGAAAGTTTTGAATGCTACCAGCGAAGAAGTTGAAAAAATGCTCGAAGGTTATTCTGATATAAGAATGGTAGCTACTTTTGCCACTGCCTTGAAACGGGCACTCTGTGAATCGGAAGATAAGAAAAACGAATCTAGGGATCAGATACAAATGCTCAAACAACAGCTATCTGCTGAAAAACGTTATGTTGCTGAAATCCAGGCTAAGTTACTTTCAACGGAGGAACTACTAAGGGTAACTAAACGTAAATACTTTAGTTTGAAACACAAACGTAAGGCGGATTCGTTAGATTCAACAGATTCTCTAGACATGGCGGTGAAGCAAATGAAGCCAGATCAAGACCTCAACGACGCGGTCATAGTTCAAGATAGTGACACAAGCAACACCAGTATTAATACAATGGTGAATAGAATAGAAAATTCAGAATCACCGTACTTAAGTTTAAAACAAAGCAGCCTGGCGCTGACCGCCTTACAAAGACATCCCACACATCCGCTACCTGATAGGAACCTTAAGCCATCAGAACTAGCTCTGTTTAATTCAGCTCGGAACGCGATCACCAAGAAACCAGATATACAGAGGACAAGCATCTTCCACCAGAAGGAACCGATTAAGATACAATTATCTTCAGAGAATGACCCAAATATGTCATTACTAAACATATCCTATGACGGCCTGGGGGGTCATTCTAAACATGACACTTTCCCCTCACCGAGACAATCAATGAAAAGTTGCATTCCAAAACTATCAGCCAAACACAAATTAAAACGACCAAATCCGATAGGAAGCAGAGATATAAGCAAAATGCTGAAAAAAACTTAA

Protein sequence:

>DPOGS214851-PA
MHILCTICSDIVNQAENIYVTKCGHVFHYNCLSKWIARSKSCPQCRNKVTDKCMFRFYPTISNEATNEDAATLQSRLDDAQLQLRQQKASCKEHEEKISATEAEIKKNLALLKACEKKLESRDTAIAALKEQLQYVKIQNNETNRLREENEVLKKNMQTLNGLQKVLNATSEEVEKMLEGYSDIRMVATFATALKRALCESEDKKNESRDQIQMLKQQLSAEKRYVAEIQAKLLSTEELLRVTKRKYFSLKHKRKADSLDSTDSLDMAVKQMKPDQDLNDAVIVQDSDTSNTSINTMVNRIENSESPYLSLKQSSLALTALQRHPTHPLPDRNLKPSELALFNSARNAITKKPDIQRTSIFHQKEPIKIQLSSENDPNMSLLNISYDGLGGHSKHDTFPSPRQSMKSCIPKLSAKHKLKRPNPIGSRDISKMLKKT-