Monarch geneset OGS2.0

DPOGS204410
TranscriptDPOGS204410-TA1281 bp
ProteinDPOGS204410-PA426 aa
Genomic positionDPSCF300002 - 679189-683883
RNAseq coverage249x (Rank: top 42%)
Annotation
HeliconiusHMEL0062728e-13672.84% 
BombyxBGIBMGA007716-TA4e-15581.48% 
Drosophilaarg-PA5e-6742.36% 
EBI UniRef50UniRef50_Q2F6C61e-14980.25%Arginase n=3 Tax=Bombyx mori RepID=Q2F6C6_BOMMO
NCBI RefSeqNP_001040105.12e-15080.25%arginase [Bombyx mori]
NCBI nr blastpgi|2221435563e-15281.48%arginase [Bombyx mori]
NCBI nr blastxgi|2221435561e-15181.48%arginase [Bombyx mori]
Group
Gene OntologyGO:00065251.7e-122arginine metabolic process
GO:00468721.7e-122metal ion binding
GO:00040531.7e-122arginase activity
GO:00168131.7e-122hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amidines
KEGG pathwaytca:6553666e-105 
 K01476 (E3.5.3.1, rocF, arg)maps-> Amoebiasis
    Arginine and proline metabolism
InterPro domain[1-328] IPR0140331.7e-122Arginase, subgroup
[1-328] IPR0060351.7e-122Ureohydrolase
[11-313] IPR0236964.5e-84Ureohydrolase domain
Orthology groupMCL15078 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204410-TA
ATGAGTCAACAAACAAAATGTCAACCCTTAAAAAGGGTGGGGTTGATCGGTGTTCCGTTTGAGAAAGGCCAAAAAAAGTATGGAGTCAGTATAGCACCCGCCGCTCTAAGATCAGCTGGGTTGATTGAACGCTTAAAGGACATCGATGGTTTAGATGTGAAAGACTATGGAGACATCGAGGTCCCGTCGTCCGAAAGGCCTGTAGATGTGGACAATATGGCTCACCTTCCACTTGTGTCAGCCTGCAACAAAAACCTATCAGACAAAGTATCACAAGTTCTAAAAGACGGTAGAGTTGCTGTTACCATAGGTGGAGATCATTCTATTGGAGTTGGAACGGTCGACGGGCATTATAAAGTAAACGAGAACATGATCCTTATTTGGGTAGACGCTCATGCTGACATCAACACTAACAAGACTTCCGAATCGGGTTCCGTCCATGGCATGCCAGTAGCTTTACTTGTTAAAGAATTATCTGACTACTGGCCTTATCTCCCAACCATGGACTGGCAAGTCCCAAAATTTTCGATAAAGAATCTCGGATACATTGGCCTTAGATCAGTAGACAAGTACGAAAGGCTGGCAATAGAAAAATACGACGTGCCTGCGTTCGCAATGGAAGACATAGAGGATTACGGAATTCATAAATCCATAGACCACGTTCTGCAGAGGCTAGACCCCAAAGGAAATAAACCGATCCACGTCAGTTTCGACATCGATTCATTGGATTCTTTAGAGGCTCCCAGTACGGGCACCCCTGTTCGAGGAGGTCTTACACTTCGGGAAGCTATCAAATTGATGGAGATTATTCACGCAACTGGCCGTCTCCGGGCATTTGACCTTGTTGAGATTAACCCAGCTCTTGGTAATGACTCTGATAGGAAGAGAACTATCGAAGCTGGCATGAGCGTGATGATGGCAGCCTTAGGATTCTCGCGACGTGGGATGACACCGCGTGGGATTTTAGCCCTAAATTGCATATTCTGTGAAACGGATCCTAAACAAAATGACATGATTGATGTAGAAAAAGTATTCGAAGAATATTTCTCAAAATTACCGAATAACCAAAGTAAATACGTTATAAACGAAGGGATATCGAAAATGATTTCAAGATATTTTGATGGAAAAGTGAACGCGTTCCCATTGTTTAAAGTAAGTATTAACATCGACCGTAAGCCCGTCAAGATACCCTGGACTTTGGACAAAGCAGATAAATTAAATGAAACCAAATTAAAAAAGAGGTACCAGTACGTTAAGATGGAAACAACTAAAAATGAATAA

Protein sequence:

>DPOGS204410-PA
MSQQTKCQPLKRVGLIGVPFEKGQKKYGVSIAPAALRSAGLIERLKDIDGLDVKDYGDIEVPSSERPVDVDNMAHLPLVSACNKNLSDKVSQVLKDGRVAVTIGGDHSIGVGTVDGHYKVNENMILIWVDAHADINTNKTSESGSVHGMPVALLVKELSDYWPYLPTMDWQVPKFSIKNLGYIGLRSVDKYERLAIEKYDVPAFAMEDIEDYGIHKSIDHVLQRLDPKGNKPIHVSFDIDSLDSLEAPSTGTPVRGGLTLREAIKLMEIIHATGRLRAFDLVEINPALGNDSDRKRTIEAGMSVMMAALGFSRRGMTPRGILALNCIFCETDPKQNDMIDVEKVFEEYFSKLPNNQSKYVINEGISKMISRYFDGKVNAFPLFKVSINIDRKPVKIPWTLDKADKLNETKLKKRYQYVKMETTKNE-