Monarch geneset OGS2.0

DPOGS202228
TranscriptDPOGS202228-TA1197 bp
ProteinDPOGS202228-PA398 aa
Genomic positionDPSCF300149 + 428478-451447
RNAseq coverage749x (Rank: top 17%)
Annotation
HeliconiusHMEL0091806e-7893.96% 
BombyxBGIBMGA012097-TA3e-11184.03% 
DrosophilaCG1358-PB3e-14862.84% 
EBI UniRef50UniRef50_Q7K1D74e-14662.84%CG1358, isoform A n=20 Tax=Pancrustacea RepID=Q7K1D7_DROME
NCBI RefSeqXP_002050338.12e-14762.84%GJ20266 [Drosophila virilis]
NCBI nr blastpgi|1953832483e-14662.84%GJ20266 [Drosophila virilis]
NCBI nr blastxgi|1582948648e-14964.75%AGAP005839-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550856.5e-30transmembrane transport
GO:00160216.5e-30integral to membrane
KEGG pathway 
InterPro domain[1-396] IPR0161961.6e-48Major facilitator superfamily domain, general substrate transporter
[6-350] IPR0117016.5e-30Major facilitator superfamily
Orthology groupMCL11812 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202228-TA
ATGCAGTGGCTGCAGTACAGCATCATACAGGACGCGGTCGTGAAGCACTATGGAGTCACCAGCATCACGGTCTACTGGACCTCGATGGTGTACATGATCACCTACATCCCGCTCATATTCCCAGCCAGCTACTTGCTCGATAAGACTGGTTTGAGGACAACGACGATAATAGGATCGTTCGGGACCTGTGCTGGAGCATGGTTGAAGGTGTTCTCTGTGCCACCGGACATGTTCTGGCTGGGCTTCGCTGGACAGACCGTGGTGGCGGTGGCGCAGGTCTTCATACTGAACGTACCACCGAGACTGGCAGCCGTCTGGTTCGGAGCTGACCAGGTCTCCTCGGCCTGTAGCATCGGGGTTTTTGGTAATCAGCTGGGTGTTGCCCTGGGATTCCTTCTCCCTCCTATGTTGGTGAGGTCCACGGGAACCCCCGAGGAGATCGGAGCTGATCTCCAGATGATGTTCTACTTGGTAGCAGGAGCCACCAGCGTCCTCTTTGTGTTCATCGTGCTATTCTTCAAGTCTGCTCCTCCCACGCCTCCGTCAGCGGCGGCCGACCTCGGTTCCAGCCTGGACTCCAACTTCCTCCAGTCCATCAAGAAGCTCCTCACCAATCGCAACTACATCCTGCTGCTCATCTCATACGGCCTCAACGTTGGCGTGTTTTACGCCATCTCCACGCTCCTCAACGAGCTCGTCCTTACATACTATCCAGGTGCCAACGAGGACGCGGGTCGGATCGGTTTGGTGATAGTGGTAGCGGGTATGGTGGGGTCCGTGGTGTGCGGCCTGGTGCTGGACAAGACTCACCGCTTCAAGGAGACCACGCTGGCGGTGTACGCGGCCTCCGTCCTGGGCATGCTGGTGTTCACCTTCACCTTGGACTGTGGATACATCGCCGTCGTCTACTTCAGCAGCATCGTGCTCGGTTTCTTCATGGCCGGCTATTTACCAGTCGGCTTCGAGTTCGCGTCTGAGGTCACTTACCCCGAACCCGAGGGTACTACATCTGGGATACTCAATGCTTGTGTTCAGGTTTTCGGTATAGTCCTGACTCTGCTGTTTGAGTGGATGTTGGGTGCTGCCGGAGACCGCTGGGCCAACCTGTGTCTGTGTGGTCTTCTGGCACTGGGAACCGCGGTAACGGCCGCCATCAGGTCTGACCTCCGACGACAGGCCGCTCAGAACAAGGCCTAG

Protein sequence:

>DPOGS202228-PA
MQWLQYSIIQDAVVKHYGVTSITVYWTSMVYMITYIPLIFPASYLLDKTGLRTTTIIGSFGTCAGAWLKVFSVPPDMFWLGFAGQTVVAVAQVFILNVPPRLAAVWFGADQVSSACSIGVFGNQLGVALGFLLPPMLVRSTGTPEEIGADLQMMFYLVAGATSVLFVFIVLFFKSAPPTPPSAAADLGSSLDSNFLQSIKKLLTNRNYILLLISYGLNVGVFYAISTLLNELVLTYYPGANEDAGRIGLVIVVAGMVGSVVCGLVLDKTHRFKETTLAVYAASVLGMLVFTFTLDCGYIAVVYFSSIVLGFFMAGYLPVGFEFASEVTYPEPEGTTSGILNACVQVFGIVLTLLFEWMLGAAGDRWANLCLCGLLALGTAVTAAIRSDLRRQAAQNKA-