Monarch geneset OGS2.0

DPOGS202935
TranscriptDPOGS202935-TA1113 bp
ProteinDPOGS202935-PA370 aa
Genomic positionDPSCF300220 + 47449-49109
RNAseq coverage101x (Rank: top 61%)
Annotation
HeliconiusHMEL0056318e-14894.96% 
BombyxBGIBMGA001905-TA0.081.62% 
DrosophilaPal2-PD5e-9755.06% 
EBI UniRef50UniRef50_Q9W1L57e-9555.06%Peptidyl-alpha-hydroxyglycine alpha-amidating lyase 2 n=18 Tax=Endopterygota RepID=PAL2_DROME
NCBI RefSeqXP_969342.26e-10356.27%PREDICTED: similar to peptidyl-glycine alpha-amidating monooxygenase [Tribolium castaneum]
NCBI nr blastpgi|1892388281e-10156.27%PREDICTED: similar to peptidyl-glycine alpha-amidating monooxygenase [Tribolium castaneum]
NCBI nr blastxgi|1892388281e-9856.27%PREDICTED: similar to peptidyl-glycine alpha-amidating monooxygenase [Tribolium castaneum]
Group
Gene OntologyGO:00160207.6e-13membrane
GO:00045047.6e-13peptidylglycine monooxygenase activity
GO:00065187.6e-13peptide metabolic process
GO:00055077.6e-13copper ion binding
GO:00551147.6e-13oxidation-reduction process
GO:00055156.2e-08protein binding
KEGG pathway 
InterPro domain[112-365] IPR0110422e-35Six-bladed beta-propeller, TolB-like
[78-96] IPR0007207.6e-13Peptidyl-glycine alpha-amidating monooxygenase
[133-160] IPR0012586.2e-08NHL repeat
Orthology groupMCL15470 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202935-TA
ATGTGGCCCCTAATTTTGTTGTTTCTCACTTTTCACGGAATTAAATGTGAACCGGAAACCGTTAGAGACAATGCCGATTACTTCAGTTACGGTAATGAAGAAAATGTTTTAAAAAATCTGGACCTACACGCGCCAAAAGATGAAATTGTGTTAAGACCGCAAGAAGTTCAAGACTGGCCACAACAGTCTCTGAATGTGGGACAAATAACTGCTGTCTCAATAAATTCTTTGGGACAGCCCGTAATCTTCCACCGAGCCGAAAGAGTGTGGGATGAAAGTACTTTCAATGAATCGAATGTGTATCAGAATCTCGATAAAGGTCCCATTATTGAAGATACCATATTGGTACTCGACCCTCATACAGGTTCTGTGCTTCATAGTTGGGGGGCCTATGCCTTTTATATGCCCCATGGTTTAACTGTAGACCATCATGACAACGTGTGGGTAACTGACGTGGCTAAACATCAAGTTTTCAAGTATACACCGAACAATCACAAATATCCAAGCCTTACCATCGGAGAGGCCTTTACTGCTGGTTACCCTTATAGACGTAGGGTACTGTTATGTATGCCGACGTCAGTAGCTGTCGCTACAACGGGTGAAATTTTTGTTGCCGATGGGTATTGCAACAATCAGATTTTAAAATTCAATGCCGCTGGAACTTTATTATTCGCCATACCCACATTCTCCGATACCCTGACCTTAAATCTGCCACACAGTGTCACCTTGTTGGAAAGTTTGGATGTAGTTTGCGTGGCTGACAGAGAGAATATGAGAATTGTATGCCCCAAAGCTGGGTTGAAGAGCTATGTGAATATGTTTGAAGCGGCGACTGTAATTGAAGATCCCACTCTAGGTCGTGTTTTTGCCGTGGCTTCCCATAATGATATGATTTATGCTGTTAATGGTCCGACCTCTCAAAACATCGCTGTACGGGGTTTTACTGTAAATGCCGTATATGGAAATATATTGGACACTTGGGAACCAAGCGCTGGTTTTACTAATCCTCATTCTCTGGCGGTTACAAGAAACGGCTCCCATCTTTACGTTACGGAAATTGGACCTAATAAAATCTGGAAATTCGAATTAACTGATGTCTTTGACAAGAAATAA

Protein sequence:

>DPOGS202935-PA
MWPLILLFLTFHGIKCEPETVRDNADYFSYGNEENVLKNLDLHAPKDEIVLRPQEVQDWPQQSLNVGQITAVSINSLGQPVIFHRAERVWDESTFNESNVYQNLDKGPIIEDTILVLDPHTGSVLHSWGAYAFYMPHGLTVDHHDNVWVTDVAKHQVFKYTPNNHKYPSLTIGEAFTAGYPYRRRVLLCMPTSVAVATTGEIFVADGYCNNQILKFNAAGTLLFAIPTFSDTLTLNLPHSVTLLESLDVVCVADRENMRIVCPKAGLKSYVNMFEAATVIEDPTLGRVFAVASHNDMIYAVNGPTSQNIAVRGFTVNAVYGNILDTWEPSAGFTNPHSLAVTRNGSHLYVTEIGPNKIWKFELTDVFDKK-