Monarch geneset OGS2.0

DPOGS200170
TranscriptDPOGS200170-TA1062 bp
ProteinDPOGS200170-PA353 aa
Genomic positionDPSCF300128 + 360885-363497
RNAseq coverage242x (Rank: top 43%)
Annotation
HeliconiusHMEL0075546e-5948.83% 
BombyxBGIBMGA002920-TA8e-3531.34% 
DrosophilaTsp29Fb-PB2e-2426.42% 
EBI UniRef50UniRef50_Q16LT34e-3529.24%Tetraspanin 29fb n=2 Tax=Culicidae RepID=Q16LT3_AEDAE
NCBI RefSeqXP_001656134.18e-3629.24%tetraspanin 29fb [Aedes aegypti]
NCBI nr blastpgi|1571103721e-3429.24%tetraspanin 29fb [Aedes aegypti]
NCBI nr blastxgi|96243815e-4032.15%tetraspanin D76 [Manduca sexta]
Group
Gene OntologyGO:00160213e-31integral to membrane
KEGG pathwayspu:5937931e-17 
 K06497 (CD63, MLA1)maps-> Lysosome
InterPro domain[66-296] IPR0184993e-31Tetraspanin
[68-91] IPR0003011.6e-22Tetraspanin, subgroup
[165-269] IPR0089524.8e-10Tetraspanin, EC2 domain
Orthology groupMCL34339 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200170-TA
GATGCTTTCCTCATAAGTTTTAAAACGATAATAGATGCTTTCATCAAAACATTATCAGAAGCGATACATTTAATCGTGCCTCTACACAGCTTAGTGATCGTGACAATAGAGCCGTTTTCAGGAAGTGAGTTATTAGTGAGAAGAACAACTAAAATGAAATTCACGAAGACAGAGACGGAGTACAACATGAAATCTATAAGATTTCTGTTGTTGACCATCACCACAATGTTTATAATAATAGCTATTTTGATGATAGTTCTAGGATTTTCCGTGTACTCTCAATATCACAGTTTCACTTATTTCTACGATAGCACCAAGAATGGGATTGTCTTCACTCCTTCAGTCCTCAGTATTATACTCGGCATATTTTTGTTCGTAGTTACATTGTTTGGTTTCTTTGGCAGTTTGAAACAAAGCACATGCTTGGTCAATATGTACGCCCTTATCCTGACTTTATTGCTGATTTTGAAACTGGTTGTGGTTATACTAACATTCACATTAAAACCTGAAACATTGAAGAATTATATCTACATACCCGTCTCTAGCTACGTGTCAGACAAAGAGATTGAAATGGAAATTGATCGATTACAGATTACTCTCAATTGTTGCGGAGCTAATTCGTATTTAGACTACGTGGGTATGGACTTCACCAACCAGTCCACCGTGGTTATCACCACTCGGATAAACGGCGATGAAATGGAACTGATCGTACCAGAGAGCTGTTGTTCCCCGCGAGTTGAGTTCTGCACCGCCGCGAGGTCTAACAGTTGCAAGACAGCGATCATCAATCTGTTCGTCCAGAACGCTAGTGTCATCGGAGTGATGGGAATATCGGTTATGTTTATTCAAGTTCTCGGTATAATATTCGCACTCCTACTAGCGAGATGCATTCGGAAGATGAAAAGTGAGAGAACATTTCTCTCGTGGAAAATCAAAGAGCAGATGATTTTGGCGCGCGAAGAGGAGGAAAGCACAAAGGAAACTCAGGATACCGTCCGGGAACCAAACCCGGACCCCATGAGCGGCGTCTACATCCCTCCACACGACTGCAGCACTGCATAG

Protein sequence:

>DPOGS200170-PA
DAFLISFKTIIDAFIKTLSEAIHLIVPLHSLVIVTIEPFSGSELLVRRTTKMKFTKTETEYNMKSIRFLLLTITTMFIIIAILMIVLGFSVYSQYHSFTYFYDSTKNGIVFTPSVLSIILGIFLFVVTLFGFFGSLKQSTCLVNMYALILTLLLILKLVVVILTFTLKPETLKNYIYIPVSSYVSDKEIEMEIDRLQITLNCCGANSYLDYVGMDFTNQSTVVITTRINGDEMELIVPESCCSPRVEFCTAARSNSCKTAIINLFVQNASVIGVMGISVMFIQVLGIIFALLLARCIRKMKSERTFLSWKIKEQMILAREEEESTKETQDTVREPNPDPMSGVYIPPHDCSTA-