Monarch geneset OGS2.0

DPOGS202879
TranscriptDPOGS202879-TA1104 bp
ProteinDPOGS202879-PA367 aa
Genomic positionDPSCF300126 - 439709-443388
RNAseq coverage442x (Rank: top 28%)
Annotation
HeliconiusHMEL0145802e-9473.01% 
BombyxBGIBMGA004202-TA1e-14376.74% 
DrosophilaNup44A-PB4e-10954.38% 
EBI UniRef50UniRef50_Q96EE37e-12061.63%Nucleoporin SEH1 n=91 Tax=Opisthokonta RepID=SEH1_HUMAN
NCBI RefSeqNP_001040420.10.081.48%sec13-like protein [Bombyx mori]
NCBI nr blastpgi|1140516509e-18081.48%sec13-like protein [Bombyx mori]
NCBI nr blastxgi|1140516502e-17881.48%sec13-like protein [Bombyx mori]
Group
Gene OntologyGO:00055153.4e-36protein binding
KEGG pathwaybmy:Bm1_184551e-75 
 K01840 (E5.4.2.8, manB)maps-> Amino sugar and nucleotide sugar metabolism
    Fructose and mannose metabolism
InterPro domain[16-322] IPR0159433.4e-36WD40/YVTN repeat-like-containing domain
[19-320] IPR0110464.7e-36WD40 repeat-like-containing domain
[283-320] IPR0197818.1e-09WD40 repeat, subgroup
[281-320] IPR0016802.5e-07WD40 repeat
Orthology groupMCL13458 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202879-TA
ATGAGTGAATTAGGTGGGTGTAATTTATTTGAATCACAAGCAATAGTTGCCGATCACAAAGACTTAATACATGATGTGGCTTACGACTTCTATGGGGAGAGAATGGCGACATGCTCTAGCGACCAGTATGTAAAGGTGTGGGATTCTGATGGGCAAGGTGGTTGGAAACTGACTGCAAGCTGGAAAGCACATCACGGCTCAGTGTGGAAAGTCACATGGGCACATCCTGAGTTTGGACAAGTTCTGGCTACTTGTTCCTTTGATAGGACAGCTGCTATATGGGAAGAAGTTGGTGACACAGCAGCATCGGGTACAGAGAAAGGGCTCAGGACTTGGGTGAAGAGATCAAATCTAGTGGATTCCAGGACTTCGGTCACAGATGTGAAGTTTGGGCCCAAGCATCTAGGGTTACTATTGGTGACATGTTCTGCTGATGGTATTATAAGGATATATGAAGCTCCCGATGTAATGAATTTAGCACAATGGACCTTGCAACATGAAATACCAACTAAGGTCTCTATCAGTTGTCTGTCGTGGAACCCATCATTATCAAGAAGTAGCAGTAACCCACCGATGTTGGCGGTGGGCAGCGACGAGCCCAGTGTTGCTGATAAAGCCAGTTCAGAACGAGTCTTCATATATGAGTACAGTGAATCCTCAAGGCGTTGGACCAGGACGGAGTGTTTGTCGTCTGTGGTGGAACCGGTCAATGACCTCGCCTTCGCGCCGAACCTCGGCCGCTCCTTCCACCTGCTCGCTGTGGCCACTAAAGACGTGAGGATCATCAAAATTGAACCGTTGCCTGAGTCTTCCGGTTCCGCTAACGGCAGCGTCCGCTTCAAGTCGGAAGTGTTGGCCGCCTTCGAGGAGCATTCGTCTTGTGTGTGGCGCGTCGCCTGGAACGTTACCGGGACCATGCTGGCGTCTTCCGGGGACGACTGCTGTATCAGGCTATGGAAGATGCAATACATGAACCAGTGGAAAGGTGTCGGTGTGTTCAAGAGTGAGGCGACTGGGGGAGAAGCGACCGCGCCGGCGCGTGCTCACACCACCTATACAAGACTGGCGCCCATGGCCAACCCAGCACACATGCCCTACCACTGA

Protein sequence:

>DPOGS202879-PA
MSELGGCNLFESQAIVADHKDLIHDVAYDFYGERMATCSSDQYVKVWDSDGQGGWKLTASWKAHHGSVWKVTWAHPEFGQVLATCSFDRTAAIWEEVGDTAASGTEKGLRTWVKRSNLVDSRTSVTDVKFGPKHLGLLLVTCSADGIIRIYEAPDVMNLAQWTLQHEIPTKVSISCLSWNPSLSRSSSNPPMLAVGSDEPSVADKASSERVFIYEYSESSRRWTRTECLSSVVEPVNDLAFAPNLGRSFHLLAVATKDVRIIKIEPLPESSGSANGSVRFKSEVLAAFEEHSSCVWRVAWNVTGTMLASSGDDCCIRLWKMQYMNQWKGVGVFKSEATGGEATAPARAHTTYTRLAPMANPAHMPYH-