Monarch geneset OGS2.0

DPOGS209502
TranscriptDPOGS209502-TA2073 bp
ProteinDPOGS209502-PA690 aa
Genomic positionDPSCF300127 - 39425-49568
RNAseq coverage1171x (Rank: top 11%)
Annotation
HeliconiusHMEL0089473e-7936.03% 
BombyxBGIBMGA007424-TA1e-17859.88% 
DrosophilaCG8839-PE1e-8941.47% 
EBI UniRef50UniRef50_E0VE661e-10741.03%Amidotransferase subunit A, putative n=2 Tax=Neoptera RepID=E0VE66_PEDHC
NCBI RefSeqXP_002424410.12e-10841.03%amidotransferase subunit A, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420071604e-10741.03%amidotransferase subunit A, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420071604e-10541.11%amidotransferase subunit A, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00168847.3e-151carbon-nitrogen ligase activity, with glutamine as amido-N-donor
KEGG pathwaydpo:Dpse_GA206787e-51 
 K01426 (E3.5.1.4, amiE)maps-> Styrene degradation
    Benzoate degradation via CoA ligation
    Arginine and proline metabolism
    Tryptophan metabolism
    Phenylalanine metabolism
    Cyanoamino acid metabolism
InterPro domain[40-681] IPR0001207.3e-151Amidase
[39-467] IPR0236311.1e-82Amidase signature domain
Orthology groupMCL30500 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209502-TA
ATGTCATTTTTAAAATCTATTTGTATATACATAAGAATATTAATTGACAAGACGATAGATTTCATCTTTTCGTTGTATTGGGAAGGCAAGAAGCAAGTAATACCGGACCTGGAAAAGAGGCATGCCTTCCTTGCTGAGAGTGCTACGAGCTTGGCGAGGAAGATCAAAAACAAAGAGCTGACGTCCGAGACGTTAGTTCAGGCCATGATCGAGCGAATGAAACAGGTTAACCCGCTGTTAAATGCTATCGTCGCGGATATGTATGAAACAGCCCTAGAAGAAGCGAGAGAGATTGACAGGCAAATAGCTCAAGGGCTGTCAGAGGAGTTGGCCAATAAACCCTTTTTAGGGGTGCCATTTACAACAAAGGAAAGTCAAGGTTTGAAAGGAATGCCAACGACGATGGGTCTGTGGTGCCGTCGGAACGAGAGAGCCAGCGAGGACAGCGAGGCAGTTATTAGATTAAGGAAAGCCGGGGCTGTAGCTCTAGCTACCACAAACTTGCCAGAATTATTGATATGGCAAGAGACTCGCAACCCGGTGTATGGACAGACAAACAACCCTCACCATACAGGGCGCAGTCCCGGGGGCTCTAGCGGGGCAGAAGCAGCGCTCAGCGCCACGTACGCCACTGCTATCAGCTTGTGCTCTGATATTGGCGGCTCGACTCGTATGCCAGCGTTTTTCTGTGGACTGTTTGGACATCACCCTACCGCTGGTACAACTAACACCAAAGGTTCATTTTATCGCACGGGTGAGGAAGACAGTATGTATTGTTTGGGGTTCATATCCAAGCACGTAGAAGACTTGGGGCCTCTCACTAAAATCGTGGCTGGCGATAAAGCTGACCTACTTAAGCTGGATAGAAACGTTGATTGTAAGGACATCAAATTTTATTATATAGAATCGAGCAATGATTGTCACGTGAGTCCGATCCAACCGGAAATTAAAGACGCCATGAATAAGGTGATAAAGAAGCTGCAAGAAGATTTCGGTACCACAGCCGAGCCCTACCACCATCCAGGGTTCGACAGCATGTACTCGCTGTGGGCGCACAGCATGTCCGCCGAGCCCGGGGACTTCACAACCATGCTTGTTAACGGAAAGGATCGTGTCAATGGTTTTAAAGAATTGGGAAAAAAGATGCTAGGGCTGAGTAAGTACTGCCTGTTCACGATAATGAGAGTGCTGGAGATGCAAGTGTTGCCAGCACCGAACAAGGAGTGGGCTGAGAAAACGATAAGCAGTATGAAGGAGGACCTCTTTAGTAAGTTAGGTGGTAGTGGCGTCCTGCTTCTTCCAAGTTCACCGACGGCCGCCCCCTACCACTACTCGCCAGTACTGAGACCCTACAACTTTTCATATTGGGGGCATGTTAATACGCTCAAGTGCCCCGCGACACAGGACATCAAATTTTATTATATAGAATCGAGCAATGATTGTCACGTGAGTCCGATCCAACCGGAAATTAAAGACGCCATGAATAAGGTGATAAAGAAGCTGCAAGAAGATTTCGGTACCACAGCCGAGCCCTACCACCATCCAGGGTTCGACAGCATGTACTCGCTGTGGGCGCACAGCATGTCCGCCGAGCCCGGGGACTTCACAACCATGCTTGTTAACGGAAAGGATCGTGTCAATGGTTTTAAAGAATTGGGAAAGAAGATGCTAGGGCTGAGTAAGTACTGCCTGTTCACGATAATGAGAGTGCTGGAGATGCAAGTGTTGCCAGCACCTAACAAGGAGTGGGCTGAGAAAACGATAAGCAGTATGAAGGAGGACCTCTTTAGTAAGTTAGGTGGTAGTGGCGTCCTCCTTCTTCCAAGTTCACCGACGGCCGCCCCCTACCACTACTCGCCAGTACTGAGACCCTACAACTTTTCATATTGGGGGCATGTTAATACGCTCAAGTGCCCCGCGACACAGGTACCTCTCGGTAGGAACAGTGACGGTCTTCCTATCGGTATCCAAGTCCTGGCTGCTCCTTACAACGACGCTCTCTGCCTCAGCGTCGCCAAATACTTAGAGAAAGAGTTCGGGGGAGCGATCATGGCGTGTGATATTAAGAAATAA

Protein sequence:

>DPOGS209502-PA
MSFLKSICIYIRILIDKTIDFIFSLYWEGKKQVIPDLEKRHAFLAESATSLARKIKNKELTSETLVQAMIERMKQVNPLLNAIVADMYETALEEAREIDRQIAQGLSEELANKPFLGVPFTTKESQGLKGMPTTMGLWCRRNERASEDSEAVIRLRKAGAVALATTNLPELLIWQETRNPVYGQTNNPHHTGRSPGGSSGAEAALSATYATAISLCSDIGGSTRMPAFFCGLFGHHPTAGTTNTKGSFYRTGEEDSMYCLGFISKHVEDLGPLTKIVAGDKADLLKLDRNVDCKDIKFYYIESSNDCHVSPIQPEIKDAMNKVIKKLQEDFGTTAEPYHHPGFDSMYSLWAHSMSAEPGDFTTMLVNGKDRVNGFKELGKKMLGLSKYCLFTIMRVLEMQVLPAPNKEWAEKTISSMKEDLFSKLGGSGVLLLPSSPTAAPYHYSPVLRPYNFSYWGHVNTLKCPATQDIKFYYIESSNDCHVSPIQPEIKDAMNKVIKKLQEDFGTTAEPYHHPGFDSMYSLWAHSMSAEPGDFTTMLVNGKDRVNGFKELGKKMLGLSKYCLFTIMRVLEMQVLPAPNKEWAEKTISSMKEDLFSKLGGSGVLLLPSSPTAAPYHYSPVLRPYNFSYWGHVNTLKCPATQVPLGRNSDGLPIGIQVLAAPYNDALCLSVAKYLEKEFGGAIMACDIKK-