Monarch geneset OGS2.0

DPOGS204342
TranscriptDPOGS204342-TA1158 bp
ProteinDPOGS204342-PA385 aa
Genomic positionDPSCF300142 + 87747-93271
RNAseq coverage19x (Rank: top 80%)
Annotation
HeliconiusHMEL0023182e-7061.41% 
BombyxBGIBMGA007246-TA8e-3475.90% 
DrosophilaCG8539-PA4e-2324.32% 
EBI UniRef50UniRef50_B4LE558e-2225.51%GJ12357 n=4 Tax=Drosophila RepID=B4LE55_DROVI
NCBI RefSeqXP_002062161.11e-2526.56%GK16802 [Drosophila willistoni]
NCBI nr blastpgi|1954281982e-2426.56%GK16802 [Drosophila willistoni]
NCBI nr blastxgi|1954281985e-2526.65%GK16802 [Drosophila willistoni]
Group
Gene OntologyGO:00065081.9e-31proteolysis
GO:00082701.9e-31zinc ion binding
GO:00041811.9e-31metallocarboxypeptidase activity
KEGG pathway 
InterPro domain[59-331] IPR0008341.9e-31Peptidase M14, carboxypeptidase A
Orthology groupMCL25328 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204342-TA
ATGATTTTACAAGTGAAAGTATACCTAATTGCTGCAATCTATTCTTTAGGAATTTTGTTTATGATTTTATATCCGCCTCGCAACTATGAACAATCTGGAACTAAAATGAGAAAAATCAGAAGTGCTGGTAGCAGCATTAAGTATCGGAAGTACCTCGACTATGATACAGTGGTTAGTATTTTATCACGCATGGCGACTGAGGATTCCGAGATGGCTAATATTGTGCACCTAACTCCAATGACTTCTGCAAATAATAGTATAATTGCTCTTGAACTGCATAGCGATACACGGAGCAAGAAACCAGGCGTACTCGTCGTATCAGCTATAAATGGCATGGTTTGGGGAGGGCCTAATGCAGTTATTGAGTTAGCAGAGAAACTCATATATGATACAAACTATCAGACGCCATTTTTTAATGATTATGACTGGTATTTTATACCCTTAGCAAACCCCGATGGTTTGAACTTTACAACAAATATCCGCCATTTAACTCCATTGAATGCTATGGATTGGTCTCGGAATCTAACAGCAAGAAACGGCACAAGACCCGCATTTTGGCATAAGAATATGGAATCATCCTTTGATTCCTGTTTTGGAACTAACATTAACCGTAACTTTGCCTATCATTGGCAAGACGGGGTCTCAAAGAAGATGAACTGCTCTCAATTCTATCCTGGCTCGAAACCGTTCTCGACAGCCGAAACTCAAGCACTACGCTCGTATATAGACAAGTTGGCTGATGTAATAAACATCGCTATACATTTAGATGCAAGTTTTGTACCGAAAAAGGAATTCATTCTGTATCCCTGGCGGTATTCATTGCGTCAACCGAGCAACTTTCGTACATTGCAAGACATTGGAGAATATGCAGCGCGTCAGGCTAGATTACCTGACGGAAGACTTTACGAGGTTCACCAAAGCAGCAACGACGAGCGTGTTGCGGGATCACTAACTGACTATATATCTGGTGTCGTGGGTATTGAACTGGTCTTCCTTGTTAAGCCATATCACGAGATTTTTCCTAACTACACGGACAGTTTTATATTAGAAACCTACGTCAAGAAATCCATATCGGCTATTCTAAGTTTGGTTCGCGGTTGGAGGAGCAGCAACAAACAAAACACACTCTCCTTCTTCGGAGAAGACATTGAATTCTAA

Protein sequence:

>DPOGS204342-PA
MILQVKVYLIAAIYSLGILFMILYPPRNYEQSGTKMRKIRSAGSSIKYRKYLDYDTVVSILSRMATEDSEMANIVHLTPMTSANNSIIALELHSDTRSKKPGVLVVSAINGMVWGGPNAVIELAEKLIYDTNYQTPFFNDYDWYFIPLANPDGLNFTTNIRHLTPLNAMDWSRNLTARNGTRPAFWHKNMESSFDSCFGTNINRNFAYHWQDGVSKKMNCSQFYPGSKPFSTAETQALRSYIDKLADVINIAIHLDASFVPKKEFILYPWRYSLRQPSNFRTLQDIGEYAARQARLPDGRLYEVHQSSNDERVAGSLTDYISGVVGIELVFLVKPYHEIFPNYTDSFILETYVKKSISAILSLVRGWRSSNKQNTLSFFGEDIEF-