Monarch geneset OGS2.0

DPOGS201226
TranscriptDPOGS201226-TA1584 bp
ProteinDPOGS201226-PA527 aa
Genomic positionDPSCF300037 - 346497-349617
RNAseq coverage798x (Rank: top 16%)
Annotation
HeliconiusHMEL0053520.068.83% 
BombyxBGIBMGA012491-TA0.070.91% 
DrosophilaCG5191-PB4e-13244.34% 
EBI UniRef50UniRef50_Q9I7I66e-13044.34%CG5191, isoform B n=19 Tax=Diptera RepID=Q9I7I6_DROME
NCBI RefSeqNP_650893.21e-13044.34%CG5191, isoform B [Drosophila melanogaster]
NCBI nr blastpgi|455507742e-12944.34%CG5191, isoform B [Drosophila melanogaster]
NCBI nr blastxgi|1954982191e-12544.34%GE25669 [Drosophila yakuba]
Group
Gene OntologyGO:00168842.6e-208carbon-nitrogen ligase activity, with glutamine as amido-N-donor
KEGG pathwaydpo:Dpse_GA206788e-87 
 K01426 (E3.5.1.4, amiE)maps-> Styrene degradation
    Benzoate degradation via CoA ligation
    Arginine and proline metabolism
    Tryptophan metabolism
    Phenylalanine metabolism
    Cyanoamino acid metabolism
InterPro domain[1-523] IPR0001202.6e-208Amidase
[25-516] IPR0236313e-113Amidase signature domain
Orthology groupMCL17419 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201226-TA
ATGAGGAGATGTCTCCGCTTCCTGGTGTCCCTCCTGGCCATCTTCGTCCTTCCAACCACATATATAGTTAACATCAAGCGAAACAGAAAATGTCCTCCACCAACCAACCCCATACTGTACAAATCCGCCACTACCCTGGCCATGATGATCAGAACTAAACAAATCACATCAGAAGAGGTGGTGAAGTCCTATATTGAAAGATGTAAAGAAGTTAACCCATACTTGAATGCTATCGTTGAACCACGGTACGACCTAGCTTTGAAAGAGGCGAAATGCATCGACAAAATGATAGCATCCAACGACCGCACCCCAGAAGACCTGGCCAAAGAACATCCTCTTTTGGGAGTTCCTTTGACGGTCAAGGAGAGCATTGCTGTTGAAGGCATGAGCAACGACTGCGGCACGATACACCACAAGCGTCAACCTGCTACGCGAGACGCTGACGTGGTCCGCGCAGTACGAGCTGCTGGCGCTGTGATAATAGCTGTCACTAACACGCCCCAGCTGTGCATGAACTGGGAGACATACAATAACGTCACCGGACTCACAATGAACCCGTACGACCAGAGGCTAACCACTGGCGGATCCTCTGGTGGAGAGTCAGCACTAATTTCGTCAGCAGCTTCAGTAATTGGAATGGGTTCTGATATCGCAGGTTCGCTTCGGCTTCCACCTATGTTCAATGGAATTTTTGGACATAAACCTACTCCAAAACTTATATCTATCCAGGGTCACGTTCCAGATTGTTTAGAATCTGAATTCGAGGAATACTTCGCTCTCGGACCCATCACGAGATACGCTGAAGATCTGTCTCTAATGTTAAAAGTCTTAAGACAACCTAACGGTCCTGATGTACCATTGGACAAACCTGTGGATCTCACACGTTTAAGGTTCTATTACATGGAAGGTGACTGCAGCAATGTTACTGATAACATTGGTTCGGACATGAAAAAAGCCCTTTACAAAGCCAAAGACTACATTAAGTCTACTTATAACGTAGAAGTTGAAGAGCTCAAAATACCAAACATAGAACATATGTGGGAGATAAGTGTGAGGGTTTTACTGAAGGTGAATCACGTACAGAACATCTATACAGACCCGGAAAAAAGAGACCAGTGGGTATCAGTGTGGCCCGAGGTGTTGAAGAAGATGGTAGGCATGTCGGATCATTCGTTCACGTCAGTTTTCTATGGACCAGTCAAGAAGTTCTTTGATGCTTTACCAAACAGCTATTATGAGCAGCTGCTGAAGGTTTTCGAACAAGTTAAGACTGATTTCAGCGAAGCTCTATCTGACGACGCTGTGTTACTGTTTCCGACATATCCCTACCCGGCTCACAAACATTACAGAATATTCTACAGATTCTTAAACTGCGGCTACCTAACAATATTCAATGTTTTAGGACTACCCGCTACTGCTTGTCCTTTAGGTCTATCAGATAAGGGTCTTCCTGTAGGAATACAAGTTGTTGCTAACAAATGCAATGATCATCTAACATTAGCAGTGGCGAAAGAATTCGAAAAGGCTTTCGGCGGCTGGTCGCCACCCAATAAGGATCTTTTGAACAGTGTAAAAACTGCGTAA

Protein sequence:

>DPOGS201226-PA
MRRCLRFLVSLLAIFVLPTTYIVNIKRNRKCPPPTNPILYKSATTLAMMIRTKQITSEEVVKSYIERCKEVNPYLNAIVEPRYDLALKEAKCIDKMIASNDRTPEDLAKEHPLLGVPLTVKESIAVEGMSNDCGTIHHKRQPATRDADVVRAVRAAGAVIIAVTNTPQLCMNWETYNNVTGLTMNPYDQRLTTGGSSGGESALISSAASVIGMGSDIAGSLRLPPMFNGIFGHKPTPKLISIQGHVPDCLESEFEEYFALGPITRYAEDLSLMLKVLRQPNGPDVPLDKPVDLTRLRFYYMEGDCSNVTDNIGSDMKKALYKAKDYIKSTYNVEVEELKIPNIEHMWEISVRVLLKVNHVQNIYTDPEKRDQWVSVWPEVLKKMVGMSDHSFTSVFYGPVKKFFDALPNSYYEQLLKVFEQVKTDFSEALSDDAVLLFPTYPYPAHKHYRIFYRFLNCGYLTIFNVLGLPATACPLGLSDKGLPVGIQVVANKCNDHLTLAVAKEFEKAFGGWSPPNKDLLNSVKTA-