Monarch geneset OGS2.0

DPOGS208915
TranscriptDPOGS208915-TA2283 bp
ProteinDPOGS208915-PA760 aa
Genomic positionDPSCF300009 - 422990-433692
RNAseq coverage10x (Rank: top 84%)
Annotation
HeliconiusHMEL0115580.073.93% 
BombyxBGIBMGA002488-TA0.068.29% 
DrosophilaCG6225-PA1e-11639.02% 
EBI UniRef50UniRef50_E9H9J22e-11536.85%Putative uncharacterized protein n=3 Tax=Eumetazoa RepID=E9H9J2_DAPPU
NCBI RefSeqXP_001656516.14e-11736.54%xaa-pro aminopeptidase [Aedes aegypti]
NCBI nr blastpgi|3838616201e-11638.14%PREDICTED: xaa-Pro aminopeptidase 1-like [Megachile rotundata]
NCBI nr blastxgi|3838616202e-11438.14%PREDICTED: xaa-Pro aminopeptidase 1-like [Megachile rotundata]
Group
Gene OntologyGO:00099871.7e-37cellular process
GO:00167871.5e-10hydrolase activity
KEGG pathway 
InterPro domain[437-691] IPR0009941.7e-37Peptidase M24, structural domain
[137-263] IPR0005871.5e-10Creatinase
Orthology groupMCL15619 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208915-TA
ATGCGTATCTTGCGTAAGTTGGTCTCTTCCCATTGGTCGTTGAGACCACGCCTCGTGCGTACGAACCTTGAAGTAGCTGCGACCGGCCCTTACGAGGCCCTTTTCGAACTCCTCGATTACATAAAGAAAGAAGAAATGTTACAGAAGTATGAATTTAACAGCTGCGGCTACTCGGCTGACGGAATGCAAGCAAAAGTTAGCAAGAAAGAATGTTTGTCAGCACGTGTGACGCCAACGACGACCATTCCGAGCACTCCATCCTCTCCAACGCCAGCGACCCTACTCATCAGCCAGGAGCTGCCCACGGTGCCGCAGCCCCCTCAGATATGTTCGGAAGCCGCCAACCCCAACACTCCCCTGAGCAGACTGCGGATCGCGATGAAAAATATTACTCAGTTGAATGCATTCTACGACGCGTTCCTAGTGTTCTTCAGCGACGAGCACCTGAGTGAAGAACCTGCTCCGGAGGAACGTCGTCTGGAATACATCAGCGGTTGGGAAGGTTCTGGAACAGCGGCTGTATTGGCTGACGGCGGTGCAGCTATCTGGGTACCCTCGGGAGAAGTCAGACGAGCTCGTGATTCACTATCCTGCGCCTGGCTCGTGCTGGATGAGAACGATCCCAGCCAACCCTCCATCGCCGAATGGATTAGCGAGCATCCAGGGAGGAACGGTCGTGTGGGTGGTGATGCAAGGTTGATCTCAGCGGCTGAATGGCACTTATTAACGGCGAGCCTCTCACAAGAAGGGCTGCAGTTGGTCCACGTACCGACCCTAGTGGACCAACTCTGGGATACTGAAATTGATCCTGCGAAGAGACGACCGGAATTCTCCAGGATTGTCGCCAATTTACACAATATAGATTATACAGGTGTTTCTTGGCGTGAAAAAGTGCAGGCTGTTAGGAACGAATTACGACCTCTGGGTTCAGACGCGATGGTTGTCACTGCCTTAGACGAAGTGGCTTGGCTCCTTAACATACGCGGCAAAGACCTTCCGTATGCTCCTCTACTGAAAGCGTTCGTCGTTATTGGCTCTAAAGATGTCAGAGTTTATGCACCGGCCGGCAAATTATCTTTACCAGTCCGAGAGGCGCTCGGCGTATACAATTGTTACGCCAACTCTAATAACTGCACGAGAGTAAACGAATATAAAACTATATACTCGGATCTGAGACGAGCTACTGAATCAAAAATTCTTGTTCCAACCGCTGGGACATTCCAACGAGGTGCGTCAGCGGCCATCTTGCAAAGCGTGCAACAAACTAAGCGAGTATCCCAATTATCGCCAATTATATATCTTAAGGCGCAGAAGAACCAAGCGGAGATAAAAGGTATGCGCAAAGCGCATTTACGCGACGCTGTAGCTATGAGCACGGTACTGAGTTATATGGAGGGCATGGGGAAGTCAAGTTTAACTGAGAAGTCAGTGGCTATGAAGGTAGATCTAACGCGGGCTACCCAGGCTGGTTACGTAGGTCTGTCGATGCAGACGCGGGTGTCTTTCGGACCTAATGGTGCTGAATCCGAGTATAAGGTCACTAACACATCCAATAGAAGAATATTCACCAATTCGACGCTCATCATACAGTCTGGAGGACAATACGACGAGGGCACGACCGTAGTTACCCGTACTCTACACTACGGCAACCCTACTCGTGACGAGCGTAAGGCTTACACGACTGTGCTGCGTTCCCTGTCTGCGCTGTCTATGCTGCAGACTCCGTCTACACTACCCGCCGCCCACGCTGATCCTCTCGCTAGGGCTCCGCTGTGGAATCACAAGCAAGACTACATCCCGCCAACGGGACACGGAGTTGGCGCCGCCCTAAACAGACGAGAAGATCCCGTTGTCATCGATTATCGTCAAGACACCAATCTACATCCATTCAGAGAAGGATACTTCGTCACAAGTGAGCCCGCATGGTACGAAGCTGGAAAATTTGGTGTTAAATTGGGCAATGTCTTAGAAGTGGTTCCGCGGTCTGCTGGATACCTAGGCTTCCGAGAGGCTACACTGTTGCCCTTCGAACCTAAACTGATTGATAAAAACCTACTCACTGAATACGAGATAAATTGGTTAAATACATACAACGAACGGATAAGGAAAACCGTGGGACCCGAATTAATGGACCAAGGACTAACAGACGTTTACTACTGGATGATGAACAAAACTATCAAAGTGGAATCACCGAGAGCCAAGAAATATACAAACTCAGCCTCTTCAACATATTTAAAAGTTCCTTTAGCTTTTGTCTTAGCAATTATTATTAACTACGTTTGA

Protein sequence:

>DPOGS208915-PA
MRILRKLVSSHWSLRPRLVRTNLEVAATGPYEALFELLDYIKKEEMLQKYEFNSCGYSADGMQAKVSKKECLSARVTPTTTIPSTPSSPTPATLLISQELPTVPQPPQICSEAANPNTPLSRLRIAMKNITQLNAFYDAFLVFFSDEHLSEEPAPEERRLEYISGWEGSGTAAVLADGGAAIWVPSGEVRRARDSLSCAWLVLDENDPSQPSIAEWISEHPGRNGRVGGDARLISAAEWHLLTASLSQEGLQLVHVPTLVDQLWDTEIDPAKRRPEFSRIVANLHNIDYTGVSWREKVQAVRNELRPLGSDAMVVTALDEVAWLLNIRGKDLPYAPLLKAFVVIGSKDVRVYAPAGKLSLPVREALGVYNCYANSNNCTRVNEYKTIYSDLRRATESKILVPTAGTFQRGASAAILQSVQQTKRVSQLSPIIYLKAQKNQAEIKGMRKAHLRDAVAMSTVLSYMEGMGKSSLTEKSVAMKVDLTRATQAGYVGLSMQTRVSFGPNGAESEYKVTNTSNRRIFTNSTLIIQSGGQYDEGTTVVTRTLHYGNPTRDERKAYTTVLRSLSALSMLQTPSTLPAAHADPLARAPLWNHKQDYIPPTGHGVGAALNRREDPVVIDYRQDTNLHPFREGYFVTSEPAWYEAGKFGVKLGNVLEVVPRSAGYLGFREATLLPFEPKLIDKNLLTEYEINWLNTYNERIRKTVGPELMDQGLTDVYYWMMNKTIKVESPRAKKYTNSASSTYLKVPLAFVLAIIINYV-