Monarch geneset OGS2.0

DPOGS200486
TranscriptDPOGS200486-TA1524 bp
ProteinDPOGS200486-PA507 aa
Genomic positionDPSCF300158 - 120255-124537
RNAseq coverage195x (Rank: top 48%)
Annotation
HeliconiusHMEL0126300.069.83% 
BombyxBGIBMGA010422-TA4e-9463.39% 
DrosophilaCG15221-PB2e-5629.00% 
EBI UniRef50UniRef50_D6WB989e-7231.24%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WB98_TRICA
NCBI RefSeqXP_313040.44e-7634.36%AGAP004156-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582915357e-7534.36%AGAP004156-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582915352e-7634.36%AGAP004156-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550851.1e-18transmembrane transport
GO:00160211.1e-18integral to membrane
KEGG pathwaytca:6629821e-54 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[10-493] IPR0161966e-53Major facilitator superfamily domain, general substrate transporter
[44-328] IPR0117011.1e-18Major facilitator superfamily
Orthology groupMCL21153 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200486-TA
ATGGTGACAAAAATTAGCACAGTTTCACAGAACATTCACAGTGATAATGAAATTGATTTGGATCAAGCTTTGGACATCGCCGGCTTAGGTTGGTACAACATAAAATACAGCTTGGCATTAGCTCTATTTCTGATAGCAGCAATCATAGAACCCACCGGATACTCCTTCATACTTCCAGCAGCAAAATGCGATTTACAAATGACAGATTCACAAAGAGGAGTCATTGGTTCCATCCCGTATATTGGTGTGGTTGTGACGTCATTTGTTTGGGGTTACTTAACTGATACCAGGGGAAGAAAATATATGGTGATTTACAGTTCACTAGCAGCAGGAATATTCGGCCTCGCAGCATCGTTCATGCCCGAGATCATTAGTTTTACCATATTTAAATTTTTGTCTTCCTTGTGTATTGCTTGCCCGGCTGCAGTGCCATATAGTTTCATAGGAGAAATTTTACCGAAGAGATACAGAGATATAACATTGTCTATTACAAACGCGATGCAGATTACCGGATCCGCTGTTGTACCTTTATTGGCATGGGGAGTACTGCCGCTTGATTTCCGAACGGACTTCGGGTTGTATTATTATCGTCCGTGGCGACTTCTGGCCGCTTTGTATTCGTCATTTTTCATTATTAGTGCTATAATAATGAGTTTCGGTCCAGAGAGCCCCAAATATCTAATGTCACAAGGAAAACACGATGAATCATTACGAGTACTTCAAACGATATATGCGAGAAATAAAGGCAACGAAGCCAGTGACTATCCAGTTAAAAGACTCAAATTGCCAGATCAGAAATCAGACGGCAGACAATCTTTCCTGCTTTCCCTAAAGAATCAATCACTGCCATTATTGAAACCGCCATATTTAAAATGGTTGTGTCTTAATGGCGTACTGTTTTTTGGAATATTTGCAACCTTAAATGGACTTTACATGTGGCTGCCGGACGTTTTAAACCGCGTGTTCTCTGGAAAAAGCGTTGGTTTGACGGCTTGCGGCGTTATAAGGCAGCGATTAAACGAGACGTCGGGTTCAGTAACTGGTGAATGCGATGACTCCATTGATCCAATAACGTTTAAAATCAACACAATCGCAAACATATCTTGCGCTCTGATCGCGTTAGGCATCAGTAGTACAGTGAAATTCATCGGCAAAAAAGCGTTATTGATATCGGTCTATATAATTATTGGAGTATTTTGTATATTAAATAACTTTGTTACCGAAAATATGGTGTTCGCCGTACTGCTTTCGTCCGTACCAATAACTGGCTTGGCTATTGGACCTATAAATTCATACGCAGTGGAAATCTTTCCTACACATTTAAGAGGAATGGCTATAAGTCTATCGATGATGGTCGGACGGACTGGTTCTATAGTCGGCACTAATGTCGCTGGACTTCTCATCAACGCGGCATGTGAAGTCACTTTTTATTTGTTCGGAGGTCTTTTAGTATTGTGCGGTTTCCTCTCCTTTTTGCTTCCAACATCAAAATCAAAACCAAAGAACTCGATGACTGCCCTTTGA

Protein sequence:

>DPOGS200486-PA
MVTKISTVSQNIHSDNEIDLDQALDIAGLGWYNIKYSLALALFLIAAIIEPTGYSFILPAAKCDLQMTDSQRGVIGSIPYIGVVVTSFVWGYLTDTRGRKYMVIYSSLAAGIFGLAASFMPEIISFTIFKFLSSLCIACPAAVPYSFIGEILPKRYRDITLSITNAMQITGSAVVPLLAWGVLPLDFRTDFGLYYYRPWRLLAALYSSFFIISAIIMSFGPESPKYLMSQGKHDESLRVLQTIYARNKGNEASDYPVKRLKLPDQKSDGRQSFLLSLKNQSLPLLKPPYLKWLCLNGVLFFGIFATLNGLYMWLPDVLNRVFSGKSVGLTACGVIRQRLNETSGSVTGECDDSIDPITFKINTIANISCALIALGISSTVKFIGKKALLISVYIIIGVFCILNNFVTENMVFAVLLSSVPITGLAIGPINSYAVEIFPTHLRGMAISLSMMVGRTGSIVGTNVAGLLINAACEVTFYLFGGLLVLCGFLSFLLPTSKSKPKNSMTAL-