Monarch geneset OGS2.0

DPOGS211049
TranscriptDPOGS211049-TA1464 bp
ProteinDPOGS211049-PA487 aa
Genomic positionDPSCF300202 + 257014-262581
RNAseq coverage563x (Rank: top 22%)
Annotation
HeliconiusHMEL0043302e-8949.38% 
BombyxBGIBMGA003799-TA3e-0966.13% 
DrosophilaCG6461-PA7e-1436.62% 
EBI UniRef50UniRef50_D6X3X02e-5432.00%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X3X0_TRICA
NCBI RefSeqXP_967797.23e-5231.07%PREDICTED: similar to gamma-glutamyltransferase-like 3 [Tribolium castaneum]
NCBI nr blastpgi|2700010327e-5432.00%hypothetical protein TcasGA2_TC011313 [Tribolium castaneum]
NCBI nr blastxgi|2700010323e-4931.59%hypothetical protein TcasGA2_TC011313 [Tribolium castaneum]
Group
Gene OntologyGO:00038409.9e-20gamma-glutamyltransferase activity
KEGG pathwaybta:6139194e-15 
 K00681 (ggt)maps-> Arachidonic acid metabolism
    Glutathione metabolism
    Taurine and hypotaurine metabolism
    Cyanoamino acid metabolism
    Selenoamino acid metabolism
InterPro domain[38-197] IPR0001019.9e-20Gamma-glutamyltranspeptidase
Orthology groupMCL34866 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211049-TA
ATGGCCGGCTGTGAGACACACAATGCTACAGGTGTGGTGGAGTTGCGAGAGGACGCTCCGCTGACAGCGAGTGCCACAAGCGCTCGCCTGTGCGCGGACGGACCGCGTGCTATAGTTGGAGCGTTCGCCGCTCTCACAGCCGCTGTCACTGTGGCCTTACTGACACAAATATATTATGGCGATTATGAGGTGGTGCCGCACGGTTCAGTATCCTCGTCCGCTGCATCCTGCTCGCGGGCGGGCGCGGCCACACTGCTCGCGGGAGGGCGGGCGGTCGACGCCGCTATTGCCGCCGCACTCTGCCTCGCTGTCCTGGCCCCACACCGCACATCACTCGATGCGAGCGGTTCTCTGGTGTACTGGGAGTACCGGTCATCACGGGGTCAAGGTGCTACGGTCGTGGAGTGGGGAGGGCCGGAGGAGGAGCAAGCTGACCATGAGACGAATGTGACAGACCGACCGCCCCGCCTATTAATGGCGCTGTCCGCGCTACATGACCGGCTCGGCTCCAAGCCCTGGACGGAGCTCGTGCAGCCTGCTATAGACCTCGCCAGGGAAGGTTACGTGGTTTCAGAAAGCATGTCTGCAGCGGCGTCTGCTCGTGACTTGCTGGGCTTCACGACCGGCGCCACACGGACGGAGTCCGCGCTGGCAGATTACCTCCACACCTTAGTACATAACACTAGTAAAGAGCTGTGTTCTCTATGGTCGTGTTCGTCGAGGGTGCGATGGCGGCCAGGTTCATTTGTGTCAGCCGGCTCGTGGCGCGTGTGGTCGGCGGGCCCGGGCGGGGAGGAGGCAGCCGCCGCGCTCCGTCAAGCGCTGGAACCGACACCAGCTGATACGGGGAACGCCTTCCGACGTGTAGTACAAAGTCTCCTGGAACAGAAACAGTCTAAGTCTGCGGCTAGTCAGCCTACACCGGGTGGAGTCGCTTCGGGACTTGCAATCGTCGATCCCCTGGACACTTATCTAGCACTTGTAACCGGTCTGTCTGTTCCTTTCGGATCGGGTCCAAGTGTGGGCGGCGCCTGGACTAAGGACGAACCCACCGCGCCACTCGATCTGTCCCCTGCTATTATAACCGATGATCACGTCTGTGGTACTCGATACATAATCGGAGCTGAGTCCAGTGCAGCTCTGGCCGAAGGTGCGGTAGGAGCTCTGGTGGAGGGCGCGGCGGCTGCCGTAGACTCGGCGGAGAGTGCGCAGGGCGCGGCGGCTGCCGTAGACTCGGTGGAGAGTGCGCGCGCTGTCCTGCTGCCGACCGGTGAAGTGGTGCTGGAGCCCGGACGAGGCGTTCCACTCGCACCTCCCGGAGCCACTGCAGTCCCCGGCCCCTCCTATCCGATAGCCAGCGAGCTTCCTCTGCCGCAGCCTGCTCTTAATTTTGTCCAACAGCGCGGAGACGCTCTCCTCTCGCACGCCGACAGTCGGGGCGGCGGACTCGCCTCACGCTTCTGA

Protein sequence:

>DPOGS211049-PA
MAGCETHNATGVVELREDAPLTASATSARLCADGPRAIVGAFAALTAAVTVALLTQIYYGDYEVVPHGSVSSSAASCSRAGAATLLAGGRAVDAAIAAALCLAVLAPHRTSLDASGSLVYWEYRSSRGQGATVVEWGGPEEEQADHETNVTDRPPRLLMALSALHDRLGSKPWTELVQPAIDLAREGYVVSESMSAAASARDLLGFTTGATRTESALADYLHTLVHNTSKELCSLWSCSSRVRWRPGSFVSAGSWRVWSAGPGGEEAAAALRQALEPTPADTGNAFRRVVQSLLEQKQSKSAASQPTPGGVASGLAIVDPLDTYLALVTGLSVPFGSGPSVGGAWTKDEPTAPLDLSPAIITDDHVCGTRYIIGAESSAALAEGAVGALVEGAAAAVDSAESAQGAAAAVDSVESARAVLLPTGEVVLEPGRGVPLAPPGATAVPGPSYPIASELPLPQPALNFVQQRGDALLSHADSRGGGLASRF-