Monarch geneset OGS2.0

DPOGS206858
TranscriptDPOGS206858-TA495 bp
ProteinDPOGS206858-PA164 aa
Genomic positionDPSCF300001 - 2836929-2837423
RNAseq coverage198x (Rank: top 47%)
Annotation
HeliconiusHMEL0061451e-8284.76% 
BombyxBGIBMGA012810-TA4e-8384.76% 
DrosophilaCG11110-PA7e-6465.64% 
EBI UniRef50UniRef50_E3X4V93e-6772.05%Putative uncharacterized protein n=2 Tax=Coelomata RepID=E3X4V9_ANODA
NCBI RefSeqXP_972321.17e-6669.33%PREDICTED: similar to AGAP007398-PA [Tribolium castaneum]
NCBI nr blastpgi|3123757171e-6672.05%hypothetical protein AND_13787 [Anopheles darlingi]
NCBI nr blastxgi|3123757171e-6572.05%hypothetical protein AND_13787 [Anopheles darlingi]
Group
Gene OntologyGO:00160203.8e-13membrane
GO:00065083.8e-13proteolysis
GO:00082363.8e-13serine-type peptidase activity
KEGG pathwaytca:6610392e-65 
 K09648 (IMP2)maps-> Protein export
InterPro domain[21-149] IPR0159273.7e-35Peptidase S24/S26A/S26B/S26C
[27-151] IPR0110561.1e-28Peptidase S24/S26A/S26B/S26C, beta-ribbon domain
[11-93] IPR0002233.8e-13Peptidase S26A, signal peptidase I
[29-101] IPR0197594.3e-09Peptidase S24/S26A/S26B
Orthology groupMCL10628 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206858-TA
ATGTTTCTCAAAAGTTTGTGCAAATCTATCCTTTTTGGAGTACCCATAGGCATAACCTTCCTTGACACAGTGGGATATGTCGCACGAGTGGAAGGAATTTCCATGCAACCAGTTCTCAATCCCGGAACCAAGAATACCGATTACGTATTTTTGTCGCGATGGTCAGTAAGAGATTATCAAGTTAAAAGAGGTGATGTTATCTCACTGGTGTCACCTAAAGACCCTAATCAGAAGATTATAAAGCGAGTCGTAGCTCTGGAAGGTGATGTTGTAAATACACTTGGTTACAAGAATCAATATGTAAAAATCCCTGAAGGCCACTGTTGGGTTGAAGGTGATCACACAGGTCATACATTGGATAGCAACACATTTGGTCCGGTATCATTAGGTCTGATTAATGCTAAAGCCCTTTGCATAGTTTGGCCACCCAGCAGGTGGCAGAATTTGGAAGCTAAGCTGCCAAACAACAGAACTCCTATCAGTAAATCAATATGA

Protein sequence:

>DPOGS206858-PA
MFLKSLCKSILFGVPIGITFLDTVGYVARVEGISMQPVLNPGTKNTDYVFLSRWSVRDYQVKRGDVISLVSPKDPNQKIIKRVVALEGDVVNTLGYKNQYVKIPEGHCWVEGDHTGHTLDSNTFGPVSLGLINAKALCIVWPPSRWQNLEAKLPNNRTPISKSI-