Monarch geneset OGS2.0

DPOGS201086
TranscriptDPOGS201086-TA1257 bp
ProteinDPOGS201086-PA418 aa
Genomic positionDPSCF300185 + 201740-205677
RNAseq coverage342x (Rank: top 34%)
Annotation
HeliconiusHMEL0223151e-11176.25% 
BombyxBGIBMGA006965-TA8e-10275.11% 
Drosophilapyd3-PA8e-8658.73% 
EBI UniRef50UniRef50_Q9UBR17e-8458.08%Beta-ureidopropionase n=60 Tax=Eukaryota RepID=BUP1_HUMAN
NCBI RefSeqXP_001948310.12e-8861.35%PREDICTED: similar to AGAP010229-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|3320281465e-8962.11%Beta-ureidopropionase [Acromyrmex echinatior]
NCBI nr blastxgi|3320281465e-8762.11%Beta-ureidopropionase [Acromyrmex echinatior]
Group
Gene OntologyGO:00068072.3e-53nitrogen compound metabolic process
GO:00168102.3e-53hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
GO:00516036.3e-12proteolysis involved in cellular protein catabolic process
GO:00042986.3e-12threonine-type endopeptidase activity
GO:00058396.3e-12proteasome core complex
KEGG pathwayapi:1001623305e-88 
 K01431 (E3.5.1.6)maps-> Pantothenate and CoA biosynthesis
    Drug metabolism - other enzymes
    Pyrimidine metabolism
    beta-Alanine metabolism
InterPro domain[11-236] IPR0030102.3e-53Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase
[243-309] IPR0013536.3e-12Proteasome, subunit alpha/beta
Orthology groupMCL30468 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201086-TA
ATGCCATTCTTCCTTTGCACGAAAGAGAAAAACAAGTGGGCGGATTTCGCTGAATCAGCCGTCGATGGACCAAGCACGCTTTTTCTTAGTAAATTAGCGAAAAAGTATGAAATGGTCATAGTGTCTCCTATATTAGAGAGTGATTCTGCCGAGTGGTGGAATACAGCCGTTGTGATAGATGAAGAAGGAAATTATTTGGGAAAACATCGAAAAAACCACTTGCCCAGTGTTGGGAGTTTCAGTGAGACTGCTTATTATGCACCTGGCAACACCGGTCATCCTGTTTTCAAAACTAAATACGCGAATATCGCCGTAAACATATGTTATGGTCGCCATCACGCATTAAATTGGTTAATGTTTGCTGAAAACGGTGCACAGATAGTATTTAATCCTTCAGCGACTGTAACTGATTTTGGACAATCGTTTTGGGATATAGAAGCTCGTAATGCTGCAATAGCAAATAGTTTTTTCACTTGTAGTATTAATAGGGTTGGGACGGAAGAATTTACGGTGAAAAATGAAACCAAAACAAGAACATACTACGGATCTTCTTATGTTACTGCACCTGACGGTTGTAGAACACCATGTTTACCGAATGAAAAGGATGGCCTTCTCATCACTGAAATAGATTTGAATTTATGCCGGCAAGTTAAAGATAAATGGGGTTTCCGAATGACAGCCCGATTAGATATGTATGCAAAAGAAATCAATGAATCAGTTGAAACTTATAATGGAGGTGCCGTGGTCGCCATGAAGGGGCAAGATTGTGTTGCTATAGCCACGGACAAGCGCTATGGTATCCAGGCTCAAACTGTCTCCACCAACTTTCCAAAAGTATACCAGATGGGACCCACCCTGTTCGTAGGCCTTCCAGGTCTTGCTACTGACACCCAAACAGTTTTTCAGAGACTTAAATTCAGGGCTGCGGTAGGCACAACTACTCGGTGCAAATTTTCCACAAAAAAATTCTCAAGGACGATCTACAAATCTCGTGTTGTATGCAGAAATGTTTGCATGTGTGTATGCGAAAATCTATGTGTGGGTTGTATTCGAGAAGCACTGCATATTTTTTTTGGTTTAAAATTTCGTAGAAATTCCGTTAATTGTCAAATGAAACGGGGCTCTCTGATTGGCCGCACGTTCCAGGCCCTGGTGAATGCGGCCGACCGCGATGCCATCTCCGGCTGGGGAGCGGTCGTCTACATCATAGAGAAGGACAAGATCACAGAGAAACATGTCAAGACCAGAATGGATTAG

Protein sequence:

>DPOGS201086-PA
MPFFLCTKEKNKWADFAESAVDGPSTLFLSKLAKKYEMVIVSPILESDSAEWWNTAVVIDEEGNYLGKHRKNHLPSVGSFSETAYYAPGNTGHPVFKTKYANIAVNICYGRHHALNWLMFAENGAQIVFNPSATVTDFGQSFWDIEARNAAIANSFFTCSINRVGTEEFTVKNETKTRTYYGSSYVTAPDGCRTPCLPNEKDGLLITEIDLNLCRQVKDKWGFRMTARLDMYAKEINESVETYNGGAVVAMKGQDCVAIATDKRYGIQAQTVSTNFPKVYQMGPTLFVGLPGLATDTQTVFQRLKFRAAVGTTTRCKFSTKKFSRTIYKSRVVCRNVCMCVCENLCVGCIREALHIFFGLKFRRNSVNCQMKRGSLIGRTFQALVNAADRDAISGWGAVVYIIEKDKITEKHVKTRMD-