Monarch geneset OGS2.0

DPOGS202236
TranscriptDPOGS202236-TA2013 bp
ProteinDPOGS202236-PA670 aa
Genomic positionDPSCF300149 + 602406-609498
RNAseq coverage379x (Rank: top 32%)
Annotation
HeliconiusHMEL0091715e-6946.18% 
BombyxBGIBMGA013535-TA7e-15449.10% 
DrosophilaCG10414-PA6e-6441.82% 
EBI UniRef50UniRef50_F4WE158e-12240.43%Cysteine-rich protein 2-binding protein n=7 Tax=Formicidae RepID=F4WE15_ACREC
NCBI RefSeqXP_394546.31e-12941.04%PREDICTED: similar to cysteine and glycine-rich protein 2 binding protein [Apis mellifera]
NCBI nr blastpgi|1107565253e-12841.04%PREDICTED: cysteine-rich protein 2-binding protein-like [Apis mellifera]
NCBI nr blastxgi|3504212271e-13041.78%PREDICTED: cysteine-rich protein 2-binding protein-like [Bombus impatiens]
Group
Gene OntologyGO:00081524.6e-09metabolic process
GO:00080804.6e-09N-acetyltransferase activity
GO:00055156e-05protein binding
GO:00082706e-05zinc ion binding
KEGG pathway 
InterPro domain[526-670] IPR0161812.4e-18Acyl-CoA N-acyltransferase
[11-99] IPR0110111.2e-09Zinc finger, FYVE/PHD-type
[571-640] IPR0001824.6e-09GCN5-related N-acetyltransferase (GNAT) domain
Orthology groupMCL16588 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202236-TA
ATGAACGAAACTTCTACCATTATTGCTACTACAAGTACAATAGACACAGATGCCGGTAATACAGAAACGTTAACGCCTATAGTATGCAAATATTGTCAACAACCAGAAAACAAACAATACAAGCCATGCTTAGTTTGCGAGAAATGCAAATGTGCAGTGCATTTAATTTGTTTACGACGTCCTGGTACACCTGGAGATATAGACGGAGATGTTTTCTTTGACTTCACATGCTTAGAATGTTCACCCACTAAGGAAGAGCTGTTTGAACGCAACAAATTCCCTTGGGTCCATGTACTGTTGTTAACATTAAATCACCTGAGCGAGCACATGCAGGGTATAAGCAACAGCGGCTACTTCCATTACAAAACACACATCTGTTCATTCGTCGACAGAAATTGGACCCTGTTGTTTGGGACTCGCAATCGTAAGAAGAACTGGATAGGAACAATATCCGGAGCATTGTCGGTGTACAGCAATACATTCTTCCGATCGGGCTCTGTTGCCCTCGGAGAGTCGGGATGGTGGAAACTTATGCATGGTTTTTCCCCCGCGGTCGCTGCACATATCTTACAAGAAATGAATAAAGAAAAAACAAAAGGCCAGCCTCGCAATCAGTTCACATTGGACTCCGTGTTGTTTATTAGTAAAATCAAATTAATGGGTTACGGTGTCTATCTAAACGTGGAAATGAACCCAAATAAGAGGAGGAAGTTGGACTCTGACGCTTCGGAGAGCTCGGAGATGAAGACTTATGAGGGTGAAGAAAGACATGAATACCCCGAGGAACCTGAGTCCCTGACGTGTGTGACGGCGAAGGACAACTCCGATTCCTTTTGGGAGTTCGACTTCGCGCCCGCCAGTCGCGTTCAGAGCCGCGTGTCCTCGAGTAGACCCTCCGTGTATTACGACTCGGACACCAACTCTCACTCCGACGGAGACAAGTCGCAACAAACTAAAAAGACCTCAAAAACTAAGAAAATAGTTGAAGACAAAGAAATCTTCAGGAAGGAATCTCTGTTTTCCACGCTTCCAAACTCGATAGACATGCCCTGGGTGGAGAAACATTCCAGCGAATCAGACAAGCGGAAGCTGGTCCCCTTGACGGAATACGAAGAAGTCCAGCTATTGAAGACTGTGGAAAACCTCATACCGAGAGTCAAAGACCCCAGCAAAAAGGCTGAATTGTACAGGCTGAAGGCAAAACTGTCACTACGGAGGTTGAAACGACACCAACATAAACCAGTCTTCGATTTGGACAAGGCGGTGAAGGTCTTGGGCGGGTATGTGACGGAGGACCACACAGTCGCAATGAACGCGGAGTGCATACTGGACAGATTCCAACGATCGTACCTCATAGACAACCTCTGCGGCACTATAGCGAGTACTAACTACGGCACGATGTTGTTGTCGCATATAGAACCGACGACCTTCAGATCGCCTTACTCCGGCGCCATACTGAAACCCTTCATCAGGAGGGATGCCACGTCCAGACCCCTCTGGCTGAAGGTTACTGAAGAACTCCTCATAAAGACGCATAGGCACGTCCCCGAGTACCGCGTCCCTCCCCGGCCCGTCCTCAGCTACTCCTACGTCCGTCCGCAACACATCGCGGCCGTCAACAGTCTCTGCGCCCAGTTCTTCTGGCCGGGGATAGACCTGACGGAAGCGCTCCAGTACCCTGAGTTCAGCTGCGTGGTGACGTTCGGGCGGCTGGTGGTGGCGTGCGCGTTCCTCGTCCCCGACGTGCGCCAGAGCGAGGCCTACATCAGCTTCATACTCACGAGGCCTGAATGGAGAAACGCTGGCATCGCCACCTTCATGATGTACCATCTGCTGCAGACTTGCACAGACAAGGATGTGACACTGCACGTGTCTCCCACGAACCCCGCTGTATTCCTCTACCAAAAATTTGGTTTTAAAGTTGAAGAGTTGATCCAAGATTTCTACGAAAAATACTACGACATAGACCATAAAGGTTGCCGGCATGCTCTGTTTCTAAGACTCGTTCGATAA

Protein sequence:

>DPOGS202236-PA
MNETSTIIATTSTIDTDAGNTETLTPIVCKYCQQPENKQYKPCLVCEKCKCAVHLICLRRPGTPGDIDGDVFFDFTCLECSPTKEELFERNKFPWVHVLLLTLNHLSEHMQGISNSGYFHYKTHICSFVDRNWTLLFGTRNRKKNWIGTISGALSVYSNTFFRSGSVALGESGWWKLMHGFSPAVAAHILQEMNKEKTKGQPRNQFTLDSVLFISKIKLMGYGVYLNVEMNPNKRRKLDSDASESSEMKTYEGEERHEYPEEPESLTCVTAKDNSDSFWEFDFAPASRVQSRVSSSRPSVYYDSDTNSHSDGDKSQQTKKTSKTKKIVEDKEIFRKESLFSTLPNSIDMPWVEKHSSESDKRKLVPLTEYEEVQLLKTVENLIPRVKDPSKKAELYRLKAKLSLRRLKRHQHKPVFDLDKAVKVLGGYVTEDHTVAMNAECILDRFQRSYLIDNLCGTIASTNYGTMLLSHIEPTTFRSPYSGAILKPFIRRDATSRPLWLKVTEELLIKTHRHVPEYRVPPRPVLSYSYVRPQHIAAVNSLCAQFFWPGIDLTEALQYPEFSCVVTFGRLVVACAFLVPDVRQSEAYISFILTRPEWRNAGIATFMMYHLLQTCTDKDVTLHVSPTNPAVFLYQKFGFKVEELIQDFYEKYYDIDHKGCRHALFLRLVR-