Monarch geneset OGS2.0

DPOGS204440
TranscriptDPOGS204440-TA1413 bp
ProteinDPOGS204440-PA470 aa
Genomic positionDPSCF300002 - 27668-33296
RNAseq coverage91x (Rank: top 63%)
Annotation
HeliconiusHMEL0174073e-16667.48% 
BombyxBGIBMGA013574-TA5e-11848.39% 
DrosophilaCG15221-PB1e-3726.46% 
EBI UniRef50UniRef50_D6WB982e-4929.84%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WB98_TRICA
NCBI RefSeqXP_001843449.12e-5028.90%synaptic vesicle protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700311493e-4928.90%synaptic vesicle protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700311491e-5228.82%synaptic vesicle protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00550857.4e-21transmembrane transport
GO:00160217.4e-21integral to membrane
KEGG pathwaycqu:CpipJ_CPIJ0115428e-35 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[1-460] IPR0161967.9e-40Major facilitator superfamily domain, general substrate transporter
[4-278] IPR0117017.4e-21Major facilitator superfamily
Orthology groupMCL26091 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204440-TA
ATGGGAACTGACATGTTCGGCTTCGGGCTGGTTGTGGCTGCGGCTTGCGACCTCAACATCACAGTCACTCAGAAAGGGATACTTACATCACTGCCGTTTATTGGTATCCTGCTAGTATCGTATGTATGGGGATACATATCAGACACTCGCGGCAGGCGCTTCTCCCTCATCATCTCCGTCCAGGCTGGATTCCTCCTCTCCTGCCTCGCCAGCATCACCACCAATTGGTTATTCCTGGCCTTCGTTAAGTTCTTTTCAGTTTGTTTTTCCTGTGCTATCAACTCCGTAGCGTACACCCTGGTGGGGGAGTCGTGTGTTCAGCGAGTGAGAAGCAAGTACATGTTAATGATGACCTGCTTCGTCGTGCTGTCTCCTGGAATCGCTGCCCTGCTGACATACCCAACTCTGTCGCTGAACTTCGAAAGCGATGTACCGTTTTTAGGGATCAAGTTTACTCCATGGCGTCTCCTTATGATAGTGCTCGCGATACCCATGGGAATTGGGGGGGTTATCATCTGCTATTTTCATGAATCCCCCAAGTTTCTGATAAATGTCGACAGACAGGACGAGGCATTAGAAGTCTTGAGGTCTATCTATGCGATTAACAACAGGAACTCTAGTGTCAAGAGTGATTTACAGGTAAAATCTATATACCTAGAAGACTCGGTGTCCACGAAGGAATATTCGTTATTGAAGAAAGCCTACGAGCAGAGCGCACCTCTGTTCCGACCACCGCTCCTGTGGCGAACCTGTCAGCTGTTCTTCATCGTCTCAGTTATATATTTTACAAATAACAGTCTCCTGGTGTGGCTTCCGTACGTTCTGAATATGTTGAAAGTGACCCAATCACAGACAGGTTACACTAAAGGTGACGTCTGCACCCTCATCTCCGTCAAACCTGAACCTGACAATTTTACAATGGAAGCTAATCATACTAAGATTACCCCAGACATTTGTTTGGGTTATATTGAGGACAACGTGATCATAACCCTCGTGGCGTCAGCGGTGGTCTATTCGTTCTTAAACTTCGCGCTGTCCTACCTCATGAGGTGGAGACGCCTGGTGTTGATTTCCATCCTCGTTATATCTGCCCTGAGTGGAGCTCTGTTGAATCTGATGCCGGAGCCTATCTCCAGCGTGTTTCTATTTATGGCTTTCTCTTGCACCAATCAGGCCATGGGCATCATGGCGGCTTACTTCGTGGAATTCTATCCCACATCTTGCAGGGGCATGGCTTGTTGTCTCAGTATCATGGTGGGGCGAACGAGCACGTTTGTGGGTATCAACGTGGTTGGTAACCTCATCTTCCACCACTGCCACATCACATTCTACATATGGTCTCTGGTGGTGTTCAGTAGTGCGATTGCCGCGTGGTTCCTACCTCACGACAAACCGCTGCAAAGCCGAACATGA

Protein sequence:

>DPOGS204440-PA
MGTDMFGFGLVVAAACDLNITVTQKGILTSLPFIGILLVSYVWGYISDTRGRRFSLIISVQAGFLLSCLASITTNWLFLAFVKFFSVCFSCAINSVAYTLVGESCVQRVRSKYMLMMTCFVVLSPGIAALLTYPTLSLNFESDVPFLGIKFTPWRLLMIVLAIPMGIGGVIICYFHESPKFLINVDRQDEALEVLRSIYAINNRNSSVKSDLQVKSIYLEDSVSTKEYSLLKKAYEQSAPLFRPPLLWRTCQLFFIVSVIYFTNNSLLVWLPYVLNMLKVTQSQTGYTKGDVCTLISVKPEPDNFTMEANHTKITPDICLGYIEDNVIITLVASAVVYSFLNFALSYLMRWRRLVLISILVISALSGALLNLMPEPISSVFLFMAFSCTNQAMGIMAAYFVEFYPTSCRGMACCLSIMVGRTSTFVGINVVGNLIFHHCHITFYIWSLVVFSSAIAAWFLPHDKPLQSRT-