Monarch geneset OGS2.0

DPOGS215968
TranscriptDPOGS215968-TA1842 bp
ProteinDPOGS215968-PA613 aa
Genomic positionDPSCF300078 - 687227-691526
RNAseq coverage334x (Rank: top 35%)
Annotation
HeliconiusHMEL0058850.081.31% 
BombyxBGIBMGA001083-TA0.072.27% 
DrosophilaHip14-PA0.060.70% 
EBI UniRef50UniRef50_Q9VUW90.060.70%Huntingtin-interacting protein 14 n=19 Tax=Neoptera RepID=Q9VUW9_DROME
NCBI RefSeqXP_002069180.10.061.00%GK24504 [Drosophila willistoni]
NCBI nr blastpgi|1954428920.061.00%GK24504 [Drosophila willistoni]
NCBI nr blastxgi|1187932770.062.71%AGAP011732-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00082706e-25zinc ion binding
KEGG pathwaycne:CNA041905e-56 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
InterPro domain[45-253] IPR0206831.9e-49Ankyrin repeat-containing domain
[425-478] IPR0015946e-25Zinc finger, DHHC-type, palmitoyltransferase
Orthology groupMCL14305 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215968-TA
ATGTACGATAGTACGTGTGGCGCTGCCGCCACAGGACAGTGTGGTAAATCTCAGCGCGAAGGGGACGGACCTCCGACGCGAGAGCCTCCGCCGGCTCCTCTTGAACGAGATTACAGCGGTTTTGATATCGTAAAGGCAACCCAATATGGTGCCTTCTCCAGAGTCAAGGAGCTGGTTGAAGCCGGCTGGGACGTGAACCAACCAGACCATGAAACAGTCACTCTTTTACACTGGGCTGCTATCAATAACAGACGCGAGATCATTGAGTATCTTCTATCAAAGGGAGCGAAGGTAGATGCCATCGGTGGCGAGCTCCAATCGACGCCGCTCCACTGGTCCACACGGCAAGGTCATTTGGAAGCCACTGTCGCTTTGATTAGGGCGGGCGGAGATCCGTCTCTTAGAGATGCCGAAGGTTGCGGCTGCCTGCATTTGGCAGCGCAATTCGGTCATACAGCTGTCGTGGCTTACTTGGTGGCACGAGGGGTGCCTCCGGACGCGCCCGACGCGGGCGGGATGACGCCGCTCATGTGGGCCAGCTGGAAGGTTTGCGCTGTGGACCCGACCAGGCTTCTGCTAACCCTGGGCGCCTCCCCTCAACCAGCGGACCATGCGCACGGCAACACGGCGCTTCATTGGGCAATCCTGGCCAGAAACGCCACCGCCATATCTACACTTATCCTCTATGGAAATGCAAGTTTGGATGTACCCAACCTAAGAGGTGTGACGCCTTTAACTATGTTGAAGAGTAACACTGATTCGCTGTGGGTCGGCGCCAAAGTAGCCGATAAAATAAAGGAACAAACCGCTGCCTCTTCAAAGAGAAATATCTTCCGCAGATTAGCATATGACAAAAAGTTTCGATGGTGGTGCGTCATCAGTATACCATTCCTGGCTTTTTACGCTACCGGCCTGGTCCTCGAGATGGACGCCCTCTACTTACTGAAGGGTTTCCTGCTCGTCTGTTTTTACGCACTACTACACTTCTTCACCAATGCACTATTTGACGACGATCTCAAAAATATTTTCCCGTTAAGTGTATACCTGGCAACAAAGGTGTGGTTCTATATAACGTGGGTGGTGTTGATAGCGCCGGTGGTAGGTGGTGGTGAGACGGTGGCCTTCCTTCTGTGCTCGATGTCTCTGTGGTACACATTCCTTCGTTCGTGGCGTAGCGATCCAGGCGTCATCTGCGCCTCGCGAGCTGAAAAGATGAGAACTATAATCGAGTTATCTGAGCGTGGCGGCGGCGGTGGGTTCGAGCCGGCTCGTTTCTGTTCCGCGTGTCTTCTCCGTCGTCCGCTTCGCTCTAAGCACTGTTCGGTCTGTAACCGCTGCGTCGCCAAATTCGACCACCACTGCCCGTGGGTTGCTAATTGCATCGGTGCGAAGAACCACCACTACTTCATCGGTTTCCTGGCCAGCCTGCTGGTGATGTGCGCGTGGATGCTGTGGGGCGCGGCTCAGTACTTCACCTCAGTGTGCGGCGCGGCGTCGGGCGGGACGGTCGTGCTGGTGTGGCTCCAGTGCAGTCCGTGGCTCGCCTGGGTCTCACTGAACGCCGCCTTCCACCTGTTCTGGGTGACCGTGCTGTCGTGCTGCCAGCTGTACCTCGTGGTGTGCCTCGGCATGACCACCAACGAGCAGTTGAACCGCGGCCGCTACAGGCACTTCCAGGCTCGCGGCGGTCGCTCGCCCTTCACGCGCGGTCCCCTCAACAACCTGGCGGACTTCTTCCAGTGCCGGCTGTGCGGCCTGCTGGCCCCTCGGCCTCGCGACTGGTCCGCCGCCGGCCTCGACGACACCGAGCCCATGTTGCCGCGCGACACTCACTACGTCTGA

Protein sequence:

>DPOGS215968-PA
MYDSTCGAAATGQCGKSQREGDGPPTREPPPAPLERDYSGFDIVKATQYGAFSRVKELVEAGWDVNQPDHETVTLLHWAAINNRREIIEYLLSKGAKVDAIGGELQSTPLHWSTRQGHLEATVALIRAGGDPSLRDAEGCGCLHLAAQFGHTAVVAYLVARGVPPDAPDAGGMTPLMWASWKVCAVDPTRLLLTLGASPQPADHAHGNTALHWAILARNATAISTLILYGNASLDVPNLRGVTPLTMLKSNTDSLWVGAKVADKIKEQTAASSKRNIFRRLAYDKKFRWWCVISIPFLAFYATGLVLEMDALYLLKGFLLVCFYALLHFFTNALFDDDLKNIFPLSVYLATKVWFYITWVVLIAPVVGGGETVAFLLCSMSLWYTFLRSWRSDPGVICASRAEKMRTIIELSERGGGGGFEPARFCSACLLRRPLRSKHCSVCNRCVAKFDHHCPWVANCIGAKNHHYFIGFLASLLVMCAWMLWGAAQYFTSVCGAASGGTVVLVWLQCSPWLAWVSLNAAFHLFWVTVLSCCQLYLVVCLGMTTNEQLNRGRYRHFQARGGRSPFTRGPLNNLADFFQCRLCGLLAPRPRDWSAAGLDDTEPMLPRDTHYV-