Monarch geneset OGS2.0

DPOGS216078
TranscriptDPOGS216078-TA1518 bp
ProteinDPOGS216078-PA505 aa
Genomic positionDPSCF300067 + 613051-618162
RNAseq coverage1060x (Rank: top 12%)
Annotation
HeliconiusHMEL0089470.069.05% 
BombyxBGIBMGA008875-TA0.062.57% 
DrosophilaCG8839-PE8e-14951.40% 
EBI UniRef50UniRef50_Q7K2E11e-14651.40%CG8839, isoform A n=23 Tax=Endopterygota RepID=Q7K2E1_DROME
NCBI RefSeqXP_002015801.12e-15251.20%GL11255 [Drosophila persimilis]
NCBI nr blastpgi|1951497143e-15151.20%GL11255 [Drosophila persimilis]
NCBI nr blastxgi|1951497143e-14551.30%GL11255 [Drosophila persimilis]
Group
Gene OntologyGO:00168841.4e-207carbon-nitrogen ligase activity, with glutamine as amido-N-donor
KEGG pathwaydpo:Dpse_GA206784e-82 
 K01426 (E3.5.1.4, amiE)maps-> Styrene degradation
    Benzoate degradation via CoA ligation
    Arginine and proline metabolism
    Tryptophan metabolism
    Phenylalanine metabolism
    Cyanoamino acid metabolism
InterPro domain[23-504] IPR0001201.4e-207Amidase
[19-501] IPR0236317e-117Amidase signature domain
Orthology groupMCL14832 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216078-TA
ATGCTAGTGGATGTAATAACCAGTGTTTTTTTTAAATTGTATTACGGAACAAATACAAAGAAGTTACCACCCATAAAAGATGATATTTTGAAACAACCGGCTGTGGAGGTGGCTAGAAGAATAAGAAATAAAGAGATCAGCAGTGTGGAAGTATTGAAGGCGTGCATGCAAAGAATAAGTGACACTAATTCCCAAGTGAATTGTTTTGTAGAAAATCGCTATGATTTAGCTTTACAAGAAGCCAAAGAAGCTGATAAACTTGTTCAAAGCGGTGCAAAGACTATACAACAATTGGAAAAAGAGAAACCATTTTTAGGCGTCCCTTTCACGACAAAGGATTGCATAGCTGTTAAAGGACTACATCATACCGCGGGAGTAGATTTGAGAAGAGACAAGATAGCCGAAACTGATGCAGATGTTATTAGAATTCTAAGAGAAAATGGTGCAATAATCATAGGTCTTACAAATGTTCCCGAACTTTGCATGTGGTGGGAGACCCACAACCACATATACGGAAGAACGAGCAATCCCTACGACACCACAAGAATAGTGGGAGGTTCATCTGGAGGCGAAGGATGCATTCAAGCTCTAGGAGGAAGCTGCTTTGGGATTGGTTCTGACATAGGAGGATCTATTCGAATGCCTGCATATTTTAACGGGATATTTGGTCACAAACCGTCAAGGCTGATAGTTTCCAACGTTGGACAATATCCTGAGGAACCCACAGATCTACATAAATCATTTTTATGCATTGGACCCATGACGAGATTCGCTGCAGATTTAAAACCGATCCTTAAAATTATATCGGGCGAAAACTGTGCAAAGCTTAATTTAGATAAACCTATTAATTTGAAAAACTTAAAGATATTCTATCAAATCAACAATGGTGCACCATTAACGGACAAAGTTGATAAAGACATAGTGACAGCACTAGAAAAAGTCGTAGAGTTCTTTAACAAGAAACATAACATAGTCGCCGAAGAAAAGAAAATTGAATGGCTTCAACGTTCCATTCCAATCTGGATGGAAACTATGAAAGGAAAATGTCCTTTCGGAAAATACATCATAGAAGATTATAGTATATTTGCTGTATTTAAAGAGATTTTCAAAAACATTGTAGGGCTTTCAGGGAACACTCTCATTGCGTTGTTTACATCTTTAGTAGATCGTGATGTCCTAAATCCAGAATGTAAGAGATACCAATATTATTTAAAGGTCCGTCAAGAGTTAGAAGATATTTTTAAAAATATGCTCGGTGAGGATGGAATATTTCTGTACCCAACACATCCAACACCAGCCCCTTATCACAACCAGCCCTTGGTTAAACCGATGAATTTTATATATACAGCTATAATCAACAGTCTTGGCCTCCCAGCGACAACAGTTCCTTTAGGCTTAAGTAGAGATGGACTTCCCATTGGCATACAAGTTATAGCTAACCATAATAATGACAGACTCTGCTTGGCGGTGGCGGAAGAGCTTGAAAAGGCATTTGGTGGATGGATAGAACCAAAATAA

Protein sequence:

>DPOGS216078-PA
MLVDVITSVFFKLYYGTNTKKLPPIKDDILKQPAVEVARRIRNKEISSVEVLKACMQRISDTNSQVNCFVENRYDLALQEAKEADKLVQSGAKTIQQLEKEKPFLGVPFTTKDCIAVKGLHHTAGVDLRRDKIAETDADVIRILRENGAIIIGLTNVPELCMWWETHNHIYGRTSNPYDTTRIVGGSSGGEGCIQALGGSCFGIGSDIGGSIRMPAYFNGIFGHKPSRLIVSNVGQYPEEPTDLHKSFLCIGPMTRFAADLKPILKIISGENCAKLNLDKPINLKNLKIFYQINNGAPLTDKVDKDIVTALEKVVEFFNKKHNIVAEEKKIEWLQRSIPIWMETMKGKCPFGKYIIEDYSIFAVFKEIFKNIVGLSGNTLIALFTSLVDRDVLNPECKRYQYYLKVRQELEDIFKNMLGEDGIFLYPTHPTPAPYHNQPLVKPMNFIYTAIINSLGLPATTVPLGLSRDGLPIGIQVIANHNNDRLCLAVAEELEKAFGGWIEPK-