Monarch geneset OGS2.0

DPOGS201225
TranscriptDPOGS201225-TA1560 bp
ProteinDPOGS201225-PA519 aa
Genomic positionDPSCF300037 - 556930-561612
RNAseq coverage467x (Rank: top 27%)
Annotation
HeliconiusHMEL0089474e-8033.52% 
BombyxBGIBMGA008002-TA2e-17661.76% 
DrosophilaCG7910-PA2e-10740.27% 
EBI UniRef50UniRef50_D6W8X81e-11545.71%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W8X8_TRICA
NCBI RefSeqXP_967870.12e-11645.71%PREDICTED: similar to amidase isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|910768244e-11545.71%PREDICTED: similar to amidase isoform 1 [Tribolium castaneum]
NCBI nr blastxgi|910768245e-11846.90%PREDICTED: similar to amidase isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00168842.6e-177carbon-nitrogen ligase activity, with glutamine as amido-N-donor
KEGG pathwaydpo:Dpse_GA206782e-104 
 K01426 (E3.5.1.4, amiE)maps-> Styrene degradation
    Benzoate degradation via CoA ligation
    Arginine and proline metabolism
    Tryptophan metabolism
    Phenylalanine metabolism
    Cyanoamino acid metabolism
InterPro domain[24-519] IPR0001202.6e-177Amidase
[28-518] IPR0236311.4e-102Amidase signature domain
Orthology groupMCL15412 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201225-TA
ATGGAGGTAGGAGTTTGGATAGTTGGCGTGTTATTAAGGGTCCTGTGCTTCATCACGGCTCCGTTTTTCTGGCTGCGAACCCGCAAGGAGCAGAGAGTGCCTCCGATCAAGGACCCACTGCTTATGAAAAGCGCGACAAAGTTGGCGGCTGAAATACGTAACGGGGAGTTGACTAGTGAAAATCTAGTGTCGAGATACGTATTGAGGATACAAGAAGTCAATCCGTACATCAATGCCGTGGTCGAAGATCGTTTCCAAGCCGCTATGGAGGAGGCGAGGGATGTTGATAGGAAGATATCCGAGGCTCGAGGAAGAGGGGACCTGGACAAGTTGGTAGCTGACAAACCATTGCTCGGTGTACCTTTCACTGTTAAGGAAAGTTGCTCACTCGCCGGTATGTCCAACTCCGTGGGTTGCTTGGAGTTTTTGGGTCGTCGGGCGTTAACAGACGGCGGGGGTGTGAGTCGCGTGCGGGCCGCGGGTGGAATCCCCCTGCTGGTGTCAGCGACCCCTGAACTGTGTCTGGGCTGGGAGACGACCAGCTTACTGCGAGGGCACACCAACAACCCCTATGGCCTCGCGAGGACGCCGGGAGGATCTTCAGGAGGGGAGGCGGCGTTAGTATCGTCGGGAGCGTCTGTCATATCAGTGTCGTCGGACATCGCCGGCTCCATCAGGATACCCGCAGCTTTCTGCGGTCTCTATGGACACAAACCCACGCCAGGTATAATTCCAATCTCCGGTCACATTCCGACTCTCCAGGACGAGCAATACGCTCGTTTCCTGACCGTGGGTCCCATCACTCGTTACTCCGAGGACCTGCCGCTGATGATGAAGGTGTTGGCGGGGGACAGGGCGCACGAATTGGATCTCGACACGCCAGTCGCCTTACACGAGTTAAAGGTGTACTTCATGACGGAGGCGTCTCGCTCCGTGGCGTTCTCCCCGGTGGAGCTGAGCATTCAGCGAGCGATCTTGGCGGCCGTGCAGCACCTGAAGAGCCGCGGCGCCACCGTCTGTGAGGACAAGTTCAACGACTTCGAGGACGCGGTCGAGATGTCGGCGTCGGTGTTCTTCTCGATGAAAGACATTCCCAACATGTTGCAGGACCCGGCCAACCCTAAGCGCGAGAAGAACCTGATACTTGAAACTTTGAAGACGTTACTCGGCTCGGGGTCGAGGACTTTGCAGGCGCTCGGCTTCGAGGTTCTGAAGAGGAAGAGGCTGTTCGTACCCAAAGAGAAGGTCCCCCACTACATAGAGAGGACTGACAGACTGAGAGAAACGATGGAGCGCGCCCTGGGCTGTTCCGGCGTGTTCCTGTTCCCGAGTCACTCGTGTTCGTGTCACGCCCACGGCGGCGTGTTCGTAAAGGCGGCGGGCGTTGTGTACACGATGCCGTTCAACGCGCTGGGTCTCCCGGCTACGTCGGTCCCGATCCCGGGCCCCGGGCCTCGGCCCGTCGCCGTGCAGGTGGTGGCGGGCCCAGGACAGGATCGGCTCTGCCTGGCGGTCGCCCGGGAGTTGGAGAACAAGTTCGGTGGCTGGACTCCCCCTTAA

Protein sequence:

>DPOGS201225-PA
MEVGVWIVGVLLRVLCFITAPFFWLRTRKEQRVPPIKDPLLMKSATKLAAEIRNGELTSENLVSRYVLRIQEVNPYINAVVEDRFQAAMEEARDVDRKISEARGRGDLDKLVADKPLLGVPFTVKESCSLAGMSNSVGCLEFLGRRALTDGGGVSRVRAAGGIPLLVSATPELCLGWETTSLLRGHTNNPYGLARTPGGSSGGEAALVSSGASVISVSSDIAGSIRIPAAFCGLYGHKPTPGIIPISGHIPTLQDEQYARFLTVGPITRYSEDLPLMMKVLAGDRAHELDLDTPVALHELKVYFMTEASRSVAFSPVELSIQRAILAAVQHLKSRGATVCEDKFNDFEDAVEMSASVFFSMKDIPNMLQDPANPKREKNLILETLKTLLGSGSRTLQALGFEVLKRKRLFVPKEKVPHYIERTDRLRETMERALGCSGVFLFPSHSCSCHAHGGVFVKAAGVVYTMPFNALGLPATSVPIPGPGPRPVAVQVVAGPGQDRLCLAVARELENKFGGWTPP-