Monarch geneset OGS2.0

DPOGS207739
TranscriptDPOGS207739-TA1932 bp
ProteinDPOGS207739-PA643 aa
Genomic positionDPSCF300042 - 845359-852128
RNAseq coverage715x (Rank: top 18%)
Annotation
HeliconiusHMEL0073890.063.78% 
BombyxBGIBMGA005300-TA0.069.44% 
DrosophilaCG4829-PB4e-13045.77% 
EBI UniRef50UniRef50_Q7Q4V87e-16351.26%AGAP000853-PA (Fragment) n=1 Tax=Anopheles gambiae RepID=Q7Q4V8_ANOGA
NCBI RefSeqXP_974407.12e-17752.80%PREDICTED: similar to gamma glutamyl transpeptidase [Tribolium castaneum]
NCBI nr blastpgi|910891894e-17652.80%PREDICTED: similar to gamma glutamyl transpeptidase [Tribolium castaneum]
NCBI nr blastxgi|910891895e-17249.92%PREDICTED: similar to gamma glutamyl transpeptidase [Tribolium castaneum]
Group
Gene OntologyGO:00038404.8e-256gamma-glutamyltransferase activity
KEGG pathwaytca:6632586e-177 
 K00681 (ggt)maps-> Arachidonic acid metabolism
    Glutathione metabolism
    Taurine and hypotaurine metabolism
    Cyanoamino acid metabolism
    Selenoamino acid metabolism
InterPro domain[42-641] IPR0001014.8e-256Gamma-glutamyltranspeptidase
Orthology groupMCL16333 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207739-TA
ATGGGTTCAATACGAAGCAGTTTGCACCGGCGTAATTTTATAAATAATCAAATGGATGATATGCCCATGGATACAATAGGCCTGGTTTCACCAATCAATGCATTCTTACACGGCAATTTCAGTAATGACGATAGTGATTACCCCGAAAAACCATTGCGATCGAATACGAAGCTAATAATATTCAGTCTGGTGATGTTAGCATTTGTGTCTGCGCTAAGCGGGTACCTGATCGGCCAAGCGGACTATAATCCATCAAAATTCACTGAGCCCGAGGACCCTGAACAGCCGTTAGCCCCGTCAGCATCATGGCTTCACGTCTTCCAGAAGGCAGCTGTTTGCACAGATGCACCACATTGCTCAGGAATCGGCAGGGCCATTCTCTCAATAAACGGCTCAGCAGTGGACGCGGCCATTGCGGCTATGTTCTGCAATGGTTTACTCAACCAGCAGAGCATGGGGATCGGCGGAGGATTCTTCATGACGGTTTATATAAAGGAAGAAGAGAAGGCATACTCCGTGATCGCGAGGGAGAAAGCCCCGGCTGCGGCAAAAAGAGACATGTTCGGCGGAAGCTCATATGAAGCCTCCAAAGGTAGCCTTTCTATTGGGGTGCCGGGTGAGGTGCGTGGCATGTGGGAAGCCCACAAACGTTGGGGGAGACTGCCGTGGGAGAAGCTGATCAAACCAACTCTAGAGTTCTGCAAATACGGCTTTACCATCTCCAAGGCTATGTACGACGGAATAATGAGCGCGAAGTACATCAAGAATGATCCCACTTTAAGGAGAATGTACTGGGATTCGTCAAAGAATGCGTACTACCGTCCGGGAACCCTCGTCACACCCAGCCCAGCGCTCTGTAGAACTCTAACCAGGATCGCAAAAAAGGGTGGTGACGAAATGTACAACGGATCCCTGGCCGCCGACCTCGCCGATGACCTCAATAAGACCGGAAGCATCATAACCGCTGAGGACCTTAAGCTATATCAGCCAAAGATAACAGAACCGTTGGTTGTGCCGCTCGGTAATGGCGACATACTATACACCCCTCCACCACCGAGCAGTGGCGCCATCTTGGCTAACATTCTGAACATACTAAGCGGATACAACTTCACGCCCGAGAACATACAGGGAACGGAAAACAAGATACTCACGTATCACAGGATCATCGAGGCTTTTAAGTTCGCGTACGCCGCTAGAACCAGGCTGGGGGACATGGATTTTTTGGATTTAGAAGGGTTCATCAGTAACCTAACATCACCGGAGTACGGGGTGGAATTGATGAAGAGGATCGATGACCTGAAGACCAGTAACGACAGCAGTCACTACGGAGCGACAACATACACCAAACCAGATCACGGGACAGCACACATCTCTGTTATATCTGAAGATGGAGACGCCGTGTCCGTCACAAGCTCCATCAATTTCTACTTCGGTTCAGGTGTCACGGCAACCAACACCGGCATTTTGCTGAACAATGTTATGGACGATTTCTCGTCACCCGGCTTCACCAACTACTTCAACCTTGAACCCTCGCCAGCCAACTTTATTGAACCTCACAAACGCCCCATGTCCACCATGTGTCCTAGCATCATCATCGATAGAAATGGAAATGCCAAAATGGTGATCGGTGCCGCCGGTGGCACAAAGATAGCTTCAGCTGTGGCTTTGGTAACGATGCGGAAACTTTGGTTCGGACAAACAATAAAAGAATCCGTGGACGAAGCACGAATTCATCATCAAATATTTCCAATGCATGTTGAATACGAATACGGAATCATTCAGGACATCATCAAGGGGTTGCGTGCTAAAGGGCACGGTATGGTGCGTTACCGCGGCCGAGGATCTGTGGTGTGCGCCCTCTATCGCAACAAGACCGGCATCTACGCCAACGCGGACTTTAGGAAGGGCGGCGATGTCTCTGGCATTGATTAA

Protein sequence:

>DPOGS207739-PA
MGSIRSSLHRRNFINNQMDDMPMDTIGLVSPINAFLHGNFSNDDSDYPEKPLRSNTKLIIFSLVMLAFVSALSGYLIGQADYNPSKFTEPEDPEQPLAPSASWLHVFQKAAVCTDAPHCSGIGRAILSINGSAVDAAIAAMFCNGLLNQQSMGIGGGFFMTVYIKEEEKAYSVIAREKAPAAAKRDMFGGSSYEASKGSLSIGVPGEVRGMWEAHKRWGRLPWEKLIKPTLEFCKYGFTISKAMYDGIMSAKYIKNDPTLRRMYWDSSKNAYYRPGTLVTPSPALCRTLTRIAKKGGDEMYNGSLAADLADDLNKTGSIITAEDLKLYQPKITEPLVVPLGNGDILYTPPPPSSGAILANILNILSGYNFTPENIQGTENKILTYHRIIEAFKFAYAARTRLGDMDFLDLEGFISNLTSPEYGVELMKRIDDLKTSNDSSHYGATTYTKPDHGTAHISVISEDGDAVSVTSSINFYFGSGVTATNTGILLNNVMDDFSSPGFTNYFNLEPSPANFIEPHKRPMSTMCPSIIIDRNGNAKMVIGAAGGTKIASAVALVTMRKLWFGQTIKESVDEARIHHQIFPMHVEYEYGIIQDIIKGLRAKGHGMVRYRGRGSVVCALYRNKTGIYANADFRKGGDVSGID-