Monarch geneset OGS2.0

DPOGS207533
TranscriptDPOGS207533-TA1608 bp
ProteinDPOGS207533-PA535 aa
Genomic positionDPSCF300177 + 442220-447182
RNAseq coverage473x (Rank: top 26%)
Annotation
HeliconiusHMEL0094310.082.13% 
BombyxBGIBMGA001897-TA0.077.66% 
DrosophilaCG7741-PA6e-11541.19% 
EBI UniRef50UniRef50_E2ARR23e-14147.53%CWF19-like protein 1 n=4 Tax=Neoptera RepID=E2ARR2_CAMFO
NCBI RefSeqXP_396187.28e-13844.49%PREDICTED: similar to CG7741-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|3071724081e-14047.53%CWF19-like protein 1 [Camponotus floridanus]
NCBI nr blastxgi|3071724082e-13847.63%CWF19-like protein 1 [Camponotus floridanus]
Group
Gene OntologyGO:00038245.9e-09catalytic activity
KEGG pathway 
InterPro domain[310-427] IPR0067681.4e-39Cwf19-like, C-terminal domain-1
[447-526] IPR0067674.2e-18Cwf19-like protein, C-terminal domain-2
[319-389] IPR0111465.9e-09Histidine triad-like motif
[319-390] IPR0111515.5e-06Histidine triad motif
Orthology groupMCL15313 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207533-TA
ATGGCAGATAAACAGAAAACTCTTATTTGTGGGGACGTAGATGGAAACTTCAACATATTGTTTTCCCGCGTTGAATCAATTGTGAAAAAATCCGGAGCTTTCGAGGTACTATTGTGCGTTGGAAACTTTTTTGGTGAAGATAATTCGCAATTGGATGCATATCGCATGAGGACAAGAAAAGTGCCAGTAACAACATATGTTTTCGGACCATCTAATAGTGACCATGTGGAATATTACTGTGAGGAAGGTGCTGAAATTGTTCCAAATGTCATTTACATGGGGAAAAGGGGGATATTCACCACAAGCGCTGATGTTAAAATTGCCTATCTGACGGGTATGTCCCGTCGGGAATTAGGCAAGGAGATACCTTTGTGTACATTTGAACCGAGTGATTGTAGTGCAGTGAGAGATGCATGCTTCAGAGGTACATCTGAATATAGGGGGGTTGACGTCTTAATAACAACCCTATGGCCATCTGGCATACAACAAGATGATTGCCAAAAGGCAGATATTGAGCCAGATAAATTATCCGATCTGATATCGTGGCTCGCAATCCACATAAAGCCGAGGTATCATTTCGTGCCGTCAAAAGAAAAATATTATGAAAGGCAACCTTATAGAAATCAAAGTGTACACCAGGATTACAAAGAGGGCGCCACACGTTTCATCGCATTAGCCCCCGTGGGTAATAAGGTTAAAGAAAAATGGATATACGCGTGTTCATTACAGCCAATAAACAAAATGCGAATGACTGATATATTACAAAGCACTACCGATGAGACCTCTTGCCCCTTTGACCCTGAATTGCTGAAGCAGCATCAGCCAGGGAAGGTTGTGAAGGTTTCTGGTAATGGACAATTCTTCTATAATATGGACGCTCAAGATGATGATAACGGAAAAAGGAAACGGAAAAGTGGTGACAACCCTGAACGGAAAAGGAAAGAATTTGATCCAGATACCTGCTGGTTTTGTCTGTCATCACCATCAGTGGAGAAACATTTAGTGATAAGTGTTGGTAGTCATTGCTACCTCGCTCTACCAAAGGGTCCGTTGACCTCACACCATGTCCTCATACTGCCTATAGCGCATCATCAGTCCGTTACCAAGGCACCGGATGAGGTGATAAAAGAAATTAAGAGATTCAAAGATGCATTGAAGAAGCTGTATTCCTCGATGGACCAGCTGGGAGTGTTCTTTGAGCGAAATTTCAGAACGTCACACATGCAGATACAGTGTGTACCGGTCGGGAAACAGTGTGGAGATCAGTTACTGGAGGTGTTTCAGGACGAGGCCGGCATTAATAGCATTCAGTTAGAGGTGCTGCCGCCTTATACCGACATCGCTCAAGTGTCTCTGCCGGGAGCGCCGTACTTCCACGCGGAACTTCCCTCCGGGGAACAGATATACGCTAAGACACGACAGCATTTCCCATTACAGTTTGGAAGAGATGTACTGTCAAGCCCGCCGATACTTAACTGCGAGGACAAAGCAGACTGGCGACAGTGTCTCCTCAGTAGAGAGGAGGAAGACCAGCTCGTGGCAGACTTCAGACAACAGTTCAGACCATACGACTTCACCGCTGACGATAGTGGTAGCGATAGTGAATGA

Protein sequence:

>DPOGS207533-PA
MADKQKTLICGDVDGNFNILFSRVESIVKKSGAFEVLLCVGNFFGEDNSQLDAYRMRTRKVPVTTYVFGPSNSDHVEYYCEEGAEIVPNVIYMGKRGIFTTSADVKIAYLTGMSRRELGKEIPLCTFEPSDCSAVRDACFRGTSEYRGVDVLITTLWPSGIQQDDCQKADIEPDKLSDLISWLAIHIKPRYHFVPSKEKYYERQPYRNQSVHQDYKEGATRFIALAPVGNKVKEKWIYACSLQPINKMRMTDILQSTTDETSCPFDPELLKQHQPGKVVKVSGNGQFFYNMDAQDDDNGKRKRKSGDNPERKRKEFDPDTCWFCLSSPSVEKHLVISVGSHCYLALPKGPLTSHHVLILPIAHHQSVTKAPDEVIKEIKRFKDALKKLYSSMDQLGVFFERNFRTSHMQIQCVPVGKQCGDQLLEVFQDEAGINSIQLEVLPPYTDIAQVSLPGAPYFHAELPSGEQIYAKTRQHFPLQFGRDVLSSPPILNCEDKADWRQCLLSREEEDQLVADFRQQFRPYDFTADDSGSDSE-