Monarch geneset OGS2.0

DPOGS206051
TranscriptDPOGS206051-TA1152 bp
ProteinDPOGS206051-PA383 aa
Genomic positionDPSCF300028 - 1037226-1040111
RNAseq coverage158x (Rank: top 52%)
Annotation
HeliconiusHMEL0028260.076.76% 
BombyxBGIBMGA000728-TA5e-11876.68% 
DrosophilaCG5880-PA2e-10951.94% 
EBI UniRef50UniRef50_Q9VB734e-10751.94%CG5880 n=25 Tax=Endopterygota RepID=Q9VB73_DROME
NCBI RefSeqXP_001656662.16e-11152.54%hypothetical protein AaeL_AAEL003238 [Aedes aegypti]
NCBI nr blastpgi|1571354441e-10952.54%hypothetical protein AaeL_AAEL003238 [Aedes aegypti]
NCBI nr blastxgi|1951089133e-11356.42%GI23298 [Drosophila mojavensis]
Group
Gene OntologyGO:00082705.5e-25zinc ion binding
KEGG pathwaycne:CNA041901e-13 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
InterPro domain[141-194] IPR0015945.5e-25Zinc finger, DHHC-type, palmitoyltransferase
Orthology groupMCL11932 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206051-TA
ATGGTAGTTTTACAATGGCGTTTTCAATGGAGGGGACTTTTTGGAAACATCAAAGCATACATAATATGGAAGTTTGCCCAAGCATTACTAACGTTTCACGTCTTAACATACAACACGCACATGAGTCAGAGCTACGCAATAGATTGCCTTTTGGAACCAGTTTTTTGGTTTGTCGATAACTTTGCGGGATACTTAGGAAGGGTATTTGTATTTTGTGTGAGTATACTTACTAGCGCAGTGGTGTTCATCGCTTACTGGGTGGGACTGCCGTATTGGTGGGAGAAGAGCCAAAGTGTGACAATTGCTTTAGTTATATTTGGAAACTGGATTCTGCTCAATATTATCTTTCATTACTATATGGGAGTGAAAACCTCACCTGGATACGCACCTCATGGTTCTTTAATATCAGAAGCTGCGAGTATATGCAAGAAGTGTATATCTCCGAAGCCTCCCAGAACCCATCACTGTTCGGTGTGCGACCGCTGCATTCTTGGGATGGATCATCACTGTCCCTGGCTAAATAACTGCGTGGGCTTCTACAATGCAAGATATTTTTATTTATACATGGTATATATGGTGATGGGCGTAACGTTCTTGATTGTGGCTGGTATTGATTTGGGCTATCAAGTCCTTTGGGTGAATGATACAGGTGGTTTAATGTCTGAAAATGATCCAGATCTCATTGGCCATCCAGTTCGCATGAACCAGAGCGGAGTATTGGTCCCAGTGCAAGTTATAGCCGAATATGATTCTGTGAATTTCCCTCGCAACCATATTCTTCCGACTCCAGTTATAACCGAGGCTCAGAGGATAACAGCTAACTCATTGAAGAGGAAAGCAGTTATGTTCATGGCGATGATATGTTTGTCGGTGTTGTTTGCCCTTGGTGCACTAGTCGTTATGCACGGGAAGAACATTAGCAGAGGCGAAACTAGCATAGAAGCTCATATAAATGATAGGCTCAGAAGGACACATAAAAATAAATTCATTAATCCATACAATTTTGGGAGGAAGAAGAATTGGAAGTTGTTTCTTGGTTTGACTCAAGGTAGGAAATTATGGAGACATGTTTTGTTGCCATCGAGTCATGCACCCACGGGTACTGGGCTTACGTGGCACACCATACACAATTCCTTAGAAGATTGGCCCTGA

Protein sequence:

>DPOGS206051-PA
MVVLQWRFQWRGLFGNIKAYIIWKFAQALLTFHVLTYNTHMSQSYAIDCLLEPVFWFVDNFAGYLGRVFVFCVSILTSAVVFIAYWVGLPYWWEKSQSVTIALVIFGNWILLNIIFHYYMGVKTSPGYAPHGSLISEAASICKKCISPKPPRTHHCSVCDRCILGMDHHCPWLNNCVGFYNARYFYLYMVYMVMGVTFLIVAGIDLGYQVLWVNDTGGLMSENDPDLIGHPVRMNQSGVLVPVQVIAEYDSVNFPRNHILPTPVITEAQRITANSLKRKAVMFMAMICLSVLFALGALVVMHGKNISRGETSIEAHINDRLRRTHKNKFINPYNFGRKKNWKLFLGLTQGRKLWRHVLLPSSHAPTGTGLTWHTIHNSLEDWP-