Monarch geneset OGS2.0

DPOGS214950
TranscriptDPOGS214950-TA942 bp
ProteinDPOGS214950-PA313 aa
Genomic positionDPSCF300280 + 71780-72805
RNAseq coverage703x (Rank: top 18%)
Annotation
HeliconiusHMEL0155872e-17896.17% 
BombyxBGIBMGA004847-TA2e-17292.01% 
Drosophilasec13-PA2e-11364.78% 
EBI UniRef50UniRef50_P557351e-11265.37%Protein SEC13 homolog n=73 Tax=Eukaryota RepID=SEC13_HUMAN
NCBI RefSeqXP_002429091.14e-13672.93%protein transport protein sec13, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420172238e-13572.93%protein transport protein sec13, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420172237e-14173.16%protein transport protein sec13, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055152.3e-58protein binding
KEGG pathwayphu:Phum_PHUM4200101e-135 
 K14004 (SEC13)maps-> Protein processing in endoplasmic reticulum
InterPro domain[10-294] IPR0159432.3e-58WD40/YVTN repeat-like-containing domain
[10-285] IPR0110466.2e-46WD40 repeat-like-containing domain
[52-87] IPR0197814.9e-09WD40 repeat, subgroup
[46-87] IPR0016803.6e-06WD40 repeat
Orthology groupMCL13496 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214950-TA
ATGATTAGTGTTTTAAATACTATTGACACTGGCCATGAAGACATGATACACGATGCAGAGTTGGATTACTATGGACTAAGATTAGCGACATGTTCTTCGGATAACTCTGTTAAAATTTACGACATCAAAAGCGGTACACAAACTCTTGCCGCAGACTTAAAAGGACACTTTGGACCGGTGTGGCAGGTTGCTTGGGCCCATCCAAAGTTTGGTAATCTACTGGCTTCTTGTTCATACGACAGAAAAGTGATCATATGGAAAGAATCCGGAGAATGGACCAAGCTTTACGAGTACTCTGGTCATGAAAGTTCTGTGAATTCAGTGGCATGGGCGCCGGAAGAATACGGATTGATACTGGCATGTTGTAGTTCAGATGGCTCAATTTCTACCATCACTTACAATCAAGATGGAGGAAATTGGGATGTTAAAAAGATACCTGGTGCCCATGCCATTGGAGTTAATTCTATCAGTTGGTGTCCAGCAATATCTGCCGATCTTCATTTAGATCCCCTTACCAACAAAGATGCACCTAAAAGGATAGTATCTGGAGGATGTGATAACTTAATTAAGATTTGGAAGGAGCAAGGAGATCAGTGGATTGAAGAGAACCGTTTGGAAATGCACATGGATTGGGTGCGGGATGTTGCCTGGGCCCCATCTCTTGGCTTGCAACGTTCTATGATTGCCAGTTGCTCTCAGGATAAAAGAGTTGTAATATGGTCCAGTGATGATAATGTGTCTTGGAGTCCAACTATCCTCAATACATTTGACGATGTCATCTGGAGTGTCAGTTGGTCTTTAACAGGAAACATACTAGCGGTATCCGGAGGAGACAATAAAGTCAGCCTTTGGAGAGAAAATGCCGATGGACAGTGGTTATGTATAAGTGAAGTAGCGAAAGGCCTGGGTCAGGCACCCAATGAAGAGAGGAGCACACTTTGA

Protein sequence:

>DPOGS214950-PA
MISVLNTIDTGHEDMIHDAELDYYGLRLATCSSDNSVKIYDIKSGTQTLAADLKGHFGPVWQVAWAHPKFGNLLASCSYDRKVIIWKESGEWTKLYEYSGHESSVNSVAWAPEEYGLILACCSSDGSISTITYNQDGGNWDVKKIPGAHAIGVNSISWCPAISADLHLDPLTNKDAPKRIVSGGCDNLIKIWKEQGDQWIEENRLEMHMDWVRDVAWAPSLGLQRSMIASCSQDKRVVIWSSDDNVSWSPTILNTFDDVIWSVSWSLTGNILAVSGGDNKVSLWRENADGQWLCISEVAKGLGQAPNEERSTL-