Monarch geneset OGS2.0

DPOGS208828
TranscriptDPOGS208828-TA1539 bp
ProteinDPOGS208828-PA512 aa
Genomic positionDPSCF300036 + 676189-682951
RNAseq coverage147x (Rank: top 54%)
Annotation
HeliconiusHMEL0041912e-12365.14% 
BombyxBGIBMGA007934-TA4e-13166.46% 
DrosophilaCG9581-PA3e-11145.34% 
EBI UniRef50UniRef50_E2BM742e-12645.65%Probable Xaa-Pro aminopeptidase 3 n=2 Tax=Formicidae RepID=E2BM74_HARSA
NCBI RefSeqXP_001605691.13e-13148.28%PREDICTED: similar to xaa-pro dipeptidase app(e.coli) [Nasonia vitripennis]
NCBI nr blastpgi|3454937902e-13045.14%PREDICTED: probable Xaa-Pro aminopeptidase 3-like [Nasonia vitripennis]
NCBI nr blastxgi|3454937902e-12645.14%PREDICTED: probable Xaa-Pro aminopeptidase 3-like [Nasonia vitripennis]
Group
Gene OntologyGO:00099871.8e-58cellular process
GO:00041772.3e-32aminopeptidase activity
GO:00301452.3e-32manganese ion binding
KEGG pathway 
InterPro domain[250-512] IPR0009941.8e-58Peptidase M24, structural domain
[74-205] IPR0078652.3e-32Peptidase M24B, X-Pro dipeptidase/aminopeptidase P N-terminal
Orthology groupMCL12142 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208828-TA
ATGCAGAGTTTTCGTCGTTTATCATTACAATTAGCGAGTAAAAGTCCTAGAAGACTTCTAAGGAACACGCTAATTCGAGAAAGATGTAGATATAAATCGAGTATAGGCAATCCCCCTGATGTTATGTCAGAAACAACAACTATTCCCAAAGGAATACTTGGCCAACCTACTTGCCACACTCACCCTCACTTGATATCCGATGGCCACCTGACTTGCGGTATCACACAACAAGAGTATAAGGATAGAAGAGATACTTTAGTAGAAAAGCTAGTGGCAAGCAAAGAAAATGAACATAGATCTCATATTATAGTAATTCCAGCAGCACGTAAGCAGTACATGTCGGAGAAGATACCATATGTATTTAGACAGAATTCTGACTTCTTTTATCTGACAGGATGTCTGGAACCTTCTGCCATCCTAGTCATGGTGAAGCAATCACACGAAGATAGTTATAAGAGTATTCTATTTGTAAACGATAAGGACAGCCATGCTGAGTTATGGGAAGGACCGCGCACCGGATGTGCGTTGGCCGCTCCACTCTTCACAGTTGAAGAATCACGGCCCGTAGAGAATTTTAATAACTTTATACACAGAATAGTATCGACATCGAAACCAGCCATACTATGGTATCAGAATGAGTGTCCACCGAACCCTGACATCCACGAGTATGTCCGTTCCTCACTGCGTCAGGGTCATGTAACGCTGGACGAACCCCAAAAAGTACTTCATCAGATGAGGGTTATCAAATCGCCGGCTGAGATTGAGTTGATGAGAGACACTTGTCATATCGGCTCGCAGTCCATAAACCTGGCAATGGCCTGCACAAAACCTGGTATGTCAGAACATAACGTGGCTGCTATATTGGAGTACTCCTGGCGGACGGGGGGCGCGGAACACGGGGCCTTCCCCCCGGTACTGGCGGGGGGAGCGCGAGCCACTCACATACACTATGTGGCCAACAACCAACTCCTCAGACATGGAGAGATGATACTCGTGGACGCTGGGACACAAAGATGGCTGTACAATTCTGATATATCCCGCACGTGGCCCGTGTCCGGGAAGTTCTCTAAGCACCAGAGAATACTCTACGAACTGATACTTTCGGTGCAGAAACGTCTGATCGACCTGCTGGGTCAGCATCGGCCGTCTCTGGACACGTTGTTCGAGCACATGTGTCGCCTACTGGGAAGCCAGCTGCAGCAGGAGGGGATCATACCGAAGAATATTGACAACAACGAGCTTATCGGGCGAGCGTACCGCCTGTGTCCTCACCACGTGTCCCACTACCTCGGCCTGGACGTGCACGACGCGCCGCTGGTCCGGCGCCGTGTGCCCGTCACCAGCGGGATGGTGGTCACCGTCGAGCCAGGTATCTACATAGCTCCAGATGATAGATCCGTTCCAGAAGAATTCCGTGGAGTTGGCATCCGCGTCGAGGACGACGTGTTGTTGACTGACGGGGACCCCGAGGTGCTGACGCGGACCTGCCTTAAGGAGGTGGACGACATAGAGGCTGTGGTCGGCAAGCAGAGCTCGTGA

Protein sequence:

>DPOGS208828-PA
MQSFRRLSLQLASKSPRRLLRNTLIRERCRYKSSIGNPPDVMSETTTIPKGILGQPTCHTHPHLISDGHLTCGITQQEYKDRRDTLVEKLVASKENEHRSHIIVIPAARKQYMSEKIPYVFRQNSDFFYLTGCLEPSAILVMVKQSHEDSYKSILFVNDKDSHAELWEGPRTGCALAAPLFTVEESRPVENFNNFIHRIVSTSKPAILWYQNECPPNPDIHEYVRSSLRQGHVTLDEPQKVLHQMRVIKSPAEIELMRDTCHIGSQSINLAMACTKPGMSEHNVAAILEYSWRTGGAEHGAFPPVLAGGARATHIHYVANNQLLRHGEMILVDAGTQRWLYNSDISRTWPVSGKFSKHQRILYELILSVQKRLIDLLGQHRPSLDTLFEHMCRLLGSQLQQEGIIPKNIDNNELIGRAYRLCPHHVSHYLGLDVHDAPLVRRRVPVTSGMVVTVEPGIYIAPDDRSVPEEFRGVGIRVEDDVLLTDGDPEVLTRTCLKEVDDIEAVVGKQSS-