Monarch geneset OGS2.0

DPOGS202042
TranscriptDPOGS202042-TA1425 bp
ProteinDPOGS202042-PA474 aa
Genomic positionDPSCF300053 + 85557-88393
RNAseq coverage428x (Rank: top 28%)
Annotation
HeliconiusHMEL0117200.083.00% 
BombyxBGIBMGA001274-TA0.078.26% 
Drosophila% 
EBI UniRef50UniRef50_B7PB453e-12447.60%Transferrin receptor, putative n=2 Tax=Ixodes scapularis RepID=B7PB45_IXOSC
NCBI RefSeqXP_001951862.12e-13351.24%PREDICTED: similar to plasma glutamate carboxypeptidase [Acyrthosiphon pisum]
NCBI nr blastpgi|3214580197e-13651.83%hypothetical protein DAPPUDRAFT_189393 [Daphnia pulex]
NCBI nr blastxgi|910933791e-13254.34%PREDICTED: similar to predicted protein [Tribolium castaneum]
Group
Gene OntologyGO:00082333e-17peptidase activity
GO:00065083e-17proteolysis
KEGG pathwaymxa:MXAN_01004e-57 
 K01423 (E3.4.-.-)maps-> Biotin metabolism
    Lysine degradation
InterPro domain[286-450] IPR0074843e-17Peptidase M28
Orthology groupMCL11538 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202042-TA
ATGTTGGTTTTAATAAAATTGTGTTCGATTTTAATTTTATTAATTGGTGTACAAAGTAATCCAGACAGAAGAAAAAATTACTCGCAATTAAAATCATGTGATATCGGACCTTTGGCAGAAGAAATAGCATCCTACGAATCCGTGGTCAAAAACATTATAAACTACGTAGTGAACGGCCCGTTTAAAGGAAAAACATATGATGAATTATCGAAATTCGTCGATACTTTTGGTGCTCGTCCCTCAGGATCACAAATACTCGAAGACTCTATTGATTACATGATTCAGCTGACTAAGGACGAGGATATAAATGACATCGTTACGGAGGAACTAGAGGTACCACATTGGATGCGAGGAAAAGAAGAAATTACTATGATTGAACCACGAATAAAAAATATTGATTTATTAGGTTTAGGACAAAGTGTGAGCACACCATCTGAGGGTATTACCGCTGAGGTGATTGTGGTAAATAACTTCGAAGAGTTGGCTGAGATACCTAACGAAGTTGTTGAAGGTAAAATTGTGTTATACGACCCTATTTTTACGACATATCGTGAAACAGTTGTATATAGATCACAGGGTGCTGTTAGAGCAGCTGAAAAGGGAGCGGTTGCGTCATTAGTGAGGTCTATTGCACCATTCTCTATTAATTCACCTCATACTGGTTCACAAAATTATAATAATAATGTTAAAAAGATTCCAACTGCAGCCATTTCCATCGAAGATGCTGATTTAATGAGAAGACTGTTTAATAGAGGTCAAAAAATTATCTTAAATATAACAATGACGTCCACGTCTGAGACAAAAACCTCCAGGAATACGCTTATCGACCTAAAGGGAACTTTAAACCCAGAAAAGTTAGTTATTGTTTCTGGTCATATAGATAGTTGGGATGTTGGACAAGGTGCCATGGATGATGGTGGTGGCTTATTTGTAAGTTGGGCAGTACCAGTCATTTTGAAACAACTAAATATGAAACCAAAGAGAACTATAAGGTCTATATTTTGGACGGCTGAAGAGTTAGGATTAATTGGTGCTTACGCCTATGAGGAAAAACATAGAAATGAAAGTCATAACATAAATTTCATAATGGAATCCGATGAAGGTACATTCGCTCCACGTGGATTGGCTGTTGGTGGCAGTCAGAAAGCTCGATGTATTATAGCAGAAATTTTAAAACTATTCGAGTCTATAAATGCTTCTACTCTCGTAGAAGAAGACAGTCCGGGCTCTGATATTAGCGTTCTTATTAAAACCGGAATTCCAGGAGCCAGTCTTCATAATGCGAATGAAAAGTATTTTTGGTTTCATCACACGGAGGGAGATACTATGAATGTAGAAAGTCCTGAAGAACTTGATCTATGCGCGGCATTCTGGACTGCGGTGGCATATATAATAGCAGATATCTCTGCTGATATACCGCGTTAA

Protein sequence:

>DPOGS202042-PA
MLVLIKLCSILILLIGVQSNPDRRKNYSQLKSCDIGPLAEEIASYESVVKNIINYVVNGPFKGKTYDELSKFVDTFGARPSGSQILEDSIDYMIQLTKDEDINDIVTEELEVPHWMRGKEEITMIEPRIKNIDLLGLGQSVSTPSEGITAEVIVVNNFEELAEIPNEVVEGKIVLYDPIFTTYRETVVYRSQGAVRAAEKGAVASLVRSIAPFSINSPHTGSQNYNNNVKKIPTAAISIEDADLMRRLFNRGQKIILNITMTSTSETKTSRNTLIDLKGTLNPEKLVIVSGHIDSWDVGQGAMDDGGGLFVSWAVPVILKQLNMKPKRTIRSIFWTAEELGLIGAYAYEEKHRNESHNINFIMESDEGTFAPRGLAVGGSQKARCIIAEILKLFESINASTLVEEDSPGSDISVLIKTGIPGASLHNANEKYFWFHHTEGDTMNVESPEELDLCAAFWTAVAYIIADISADIPR-