Monarch geneset OGS2.0

DPOGS215724
TranscriptDPOGS215724-TA1398 bp
ProteinDPOGS215724-PA465 aa
Genomic positionDPSCF300041 + 292072-297238
RNAseq coverage190x (Rank: top 48%)
Annotation
HeliconiusHMEL0056314e-5141.02% 
BombyxBGIBMGA005815-TA1e-13358.49% 
DrosophilaPal1-PB2e-8043.89% 
EBI UniRef50UniRef50_B4KNT55e-8339.22%GI20251 n=6 Tax=Drosophila RepID=B4KNT5_DROMO
NCBI RefSeqXP_001960843.18e-8440.00%GF13565 [Drosophila ananassae]
NCBI nr blastpgi|1947571811e-8240.00%GF13565 [Drosophila ananassae]
NCBI nr blastxgi|1582853572e-8047.59%AGAP007606-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160202e-19membrane
GO:00045042e-19peptidylglycine monooxygenase activity
GO:00065182e-19peptide metabolic process
GO:00055072e-19copper ion binding
GO:00551142e-19oxidation-reduction process
GO:00055153.6e-08protein binding
KEGG pathway 
InterPro domain[113-376] IPR0110421.4e-42Six-bladed beta-propeller, TolB-like
[79-97] IPR0007202e-19Peptidyl-glycine alpha-amidating monooxygenase
[329-356] IPR0012583.6e-08NHL repeat
Orthology groupMCL16458 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215724-TA
ATGATACATTTTTATAGAAGTACACTTAAGGCCTTATTATTACTATTTGTGTCGTGTATTGCAGCAAGAAGGTTGCCGGAAGAATACTATCCGGATAATTTTTTCGCCAACCAACAGTCTTTAAAAATAAATGCGGCACTACAAAAATTAGAGCACGTTCCACAATGGGTACCCAACTGGCCGGACTCCAAAATCAAGATGGGTCAAGTATCAGGGGTAGCACTTGATAATTCAGGGCAATTGCTGGTATTTCACCGAGCTGATAATACTTGGGATGCGAATACATTTTCCATCAGGAATGTATATCAAGCCATTGGAGAGCCGCCCATCTCACAACCTACAATACTAGTTTTTAATGAAACTGGAGTTATGGTCGACTCTTGGGGACAGAATCTTTTCCACATGCCACATGGAATAACAGTCGACAGCGAGAGCAACGTCTGGGTGACGGATGTAGCTCTCCATCAGGTGTTCAAGTTCACACCAGACAACAGAACCGCGCCAGCTTTAGTGCTCGGAGAGAAGTTTGGGCCGCTACTGGACAAACATTTCTGCAAGCCGAGCGCGGTGGCCGTGCTCAGCTCCGGAGACTTCTTTGTGGCCGACGGCTACTGTAACACTCGCATCGTCAAGTACGCCGCAGACGGCACCAAGATACTGCAGTGGGGGAAACGTCTGGGCGAGTCTCCGTTCGTGTTGTCAGTCCCGCACGCGCTGTCCGTGTCGGAGGACCGCTCGCTGCTGGTGGTGGCGGACCGCGAGCGGGGCCGCGTCGCCTGCTTCAGGACGGACTCGGGCGCCTTCGTCACAGCCTTCAGACACTGGCTCATAGGACCGAGACTGTTCAGTGTAGCGTACTCGCCGATACACGGAGGTCGTCTGTATATAGTAAACGGACCAACAATCGGCCCTCCGCCGGTTAGGGGTTACGTCATAGACTTCTCGTCAGGGAGGTTGATCCAGACCTTCGCTACAGGCGACAGCTTCAGTAATCCTCACGATCTGGTGGTGTCTCCTGATGGGACGGTCTACGTCGCGGAACTGGATCCCCATAGGGTCCACAAATTCGTCGACGACACTCTAAGAAACGAGACCAAAGTTAACGTCACTAGGACGAAACCAACTACCGTCGAGGTCGGTGTAGCGGGAGGGTGGGAATGGGAGCGTTGGGGCTCGTGGGCCGGCGCCGCTGGCAGCGCGCTAGGGGCCGCGTGCGGAGCCCTGCTCTTAGCACTGTGCCGCGCCCGCGACGGTCGCAAATCGGTCGGCCGACGTCGTTGGGAATACGATCACAGTCAGTTCAAGTTACGCCGGTTGCTGGAAAGGCGGCGCTTTACACGAGTGCACTCAGATGATTCAGAAGACGAGCCCGCACCAATGTTGCCACCAACAGTATAA

Protein sequence:

>DPOGS215724-PA
MIHFYRSTLKALLLLFVSCIAARRLPEEYYPDNFFANQQSLKINAALQKLEHVPQWVPNWPDSKIKMGQVSGVALDNSGQLLVFHRADNTWDANTFSIRNVYQAIGEPPISQPTILVFNETGVMVDSWGQNLFHMPHGITVDSESNVWVTDVALHQVFKFTPDNRTAPALVLGEKFGPLLDKHFCKPSAVAVLSSGDFFVADGYCNTRIVKYAADGTKILQWGKRLGESPFVLSVPHALSVSEDRSLLVVADRERGRVACFRTDSGAFVTAFRHWLIGPRLFSVAYSPIHGGRLYIVNGPTIGPPPVRGYVIDFSSGRLIQTFATGDSFSNPHDLVVSPDGTVYVAELDPHRVHKFVDDTLRNETKVNVTRTKPTTVEVGVAGGWEWERWGSWAGAAGSALGAACGALLLALCRARDGRKSVGRRRWEYDHSQFKLRRLLERRRFTRVHSDDSEDEPAPMLPPTV-