Monarch geneset OGS2.0

DPOGS210634
TranscriptDPOGS210634-TA1491 bp
ProteinDPOGS210634-PA496 aa
Genomic positionDPSCF300168 + 657822-662511
RNAseq coverage30x (Rank: top 76%)
Annotation
HeliconiusHMEL0129155e-10555.88% 
BombyxBGIBMGA013576-TA4e-7534.41% 
DrosophilaCG15221-PB4e-4127.15% 
EBI UniRef50UniRef50_D6WB982e-5030.37%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WB98_TRICA
NCBI RefSeqXP_972569.14e-5130.37%PREDICTED: similar to SV2-like protein 1 [Tribolium castaneum]
NCBI nr blastpgi|371429383e-5629.30%SV2-like protein 1 [Ctenocephalides felis]
NCBI nr blastxgi|371429388e-5929.80%SV2-like protein 1 [Ctenocephalides felis]
Group
Gene OntologyGO:00550851.1e-22transmembrane transport
GO:00160211.1e-22integral to membrane
KEGG pathwayame:7244298e-41 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[1-493] IPR0161961.6e-48Major facilitator superfamily domain, general substrate transporter
[35-305] IPR0117011.1e-22Major facilitator superfamily
Orthology groupMCL26096 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210634-TA
ATGGATCGTCAGGATGGGCGCAAGGTCACGTTCGAAGAAGCATTGAACGAAGCAGGTTTTGGGCTGTATAGCGTGGTCCTCCTCAGCCTGTCTGGTCTTATTATAATATCTCTGGTTTGTATAGCTTACGCCAGCACTATTATAGTTCCAGCTAGCGCTTGCGAGCTGGAAACTACGACCTCGCAAAAGGGACTTTTAGCGGCTGTTCCTGTCATTGCTTTACTACTGGGGGCTGTGCCATGGGGCTACCTCACAGATATCTACGGACGCAAGAGAATGCTTATCATCTTACTTTCCTCCTCAGCCGTATTTAACGGCTTAGCTTCTATATCCGTAAACTGGATCATGTTGTTAATCTTACAGTTTTTATCTACGTTTTTCTCATCGGGCCAGTTTTCTCAAGCTATGTCCATTCTTAGCGAAAGTGTTCCAATGGCTAATCGAAATATAGTAGTGTTGCTTGTTGGAAGTATTTTCTTACTCTCGCAGGGAATTATGGCATTGATGGCAATACCAATTATTCCTCTTTCTTTCTCGTATTACATGCCAACACTGGGCATATACTGGAACTCCTGGCGATCTCTTTTGCTTTTATATAGCGCTCCAAGCTTAATATCAATAATAGGCTTATTATTTATATCTGAAAGTCCAAAGTTCCTCTTTAACAAAGGTTTAGAGAAAGAGGCTTTGTCTGTTGTGAGTCGAATTCACAAAATAAATAATATATGGTCGAAAAAGGAATTTCCTGTAAGTAACATCCAAAGAGACAGACCACATACCACAGAGACTAAAACAGCGCTTACGGACACATTTAGTTCCTTGTTCAAGAGACCGTTACTTAAAAATACAATAATTCTATCTACCCTATTCTTATTTCAACAGGTGGGATCATTCGTGTTGTGGCTGCCGACAGTATCAAACCAATTCGTCCGTATTCTAGAGACCGGGGAGGGATCTGATCTCACACTCTGTGCAATAATACGTTCTAGTATTAACACACCACCAGACACTACCACCGCGCCCTGCGCCTTGAATGTCACCTCCTTATTGCTCGTGCTATCAGTTAGCGCACTACAATCTATTGCAAATGGCCTCATAAGTTTGATAATAAACCGGACAGGTCGTCGAAATGTGGTGATATGTGTAACAGCATTCTGCGGCATAGCGGGAATCCTGGTCAATTTCGTCCCAAACGCCATCGGCAGCGCTGTTCTTTACATCGTGTTCCTCATTGGGATTGTTGCTCTGGGCTTATACTCCGCGATCGCGGTCGCGTTGTTCCCAACCTATTTGAGGGCCTTGGCGGTTGCGTTTACTATGACAATTGGTCGTATAGGAACTTTTGTGTCAGTGCAAGTTCTAAATCGCTTACTAAACGATAACTGTGAAATTAGTTTCTATATTTACGGTGGTATATTCGCATCTTCAGCTATAGTTGCTGCCTTCCTAGTCGATGATCGTCAGTTGCAACCAAAAAAAATATTGGAATAG

Protein sequence:

>DPOGS210634-PA
MDRQDGRKVTFEEALNEAGFGLYSVVLLSLSGLIIISLVCIAYASTIIVPASACELETTTSQKGLLAAVPVIALLLGAVPWGYLTDIYGRKRMLIILLSSSAVFNGLASISVNWIMLLILQFLSTFFSSGQFSQAMSILSESVPMANRNIVVLLVGSIFLLSQGIMALMAIPIIPLSFSYYMPTLGIYWNSWRSLLLLYSAPSLISIIGLLFISESPKFLFNKGLEKEALSVVSRIHKINNIWSKKEFPVSNIQRDRPHTTETKTALTDTFSSLFKRPLLKNTIILSTLFLFQQVGSFVLWLPTVSNQFVRILETGEGSDLTLCAIIRSSINTPPDTTTAPCALNVTSLLLVLSVSALQSIANGLISLIINRTGRRNVVICVTAFCGIAGILVNFVPNAIGSAVLYIVFLIGIVALGLYSAIAVALFPTYLRALAVAFTMTIGRIGTFVSVQVLNRLLNDNCEISFYIYGGIFASSAIVAAFLVDDRQLQPKKILE-