Monarch geneset OGS2.0

DPOGS202161
TranscriptDPOGS202161-TA1626 bp
ProteinDPOGS202161-PA541 aa
Genomic positionDPSCF300162 - 23051-30383
RNAseq coverage14x (Rank: top 82%)
Annotation
HeliconiusHMEL0036930.066.16% 
BombyxBGIBMGA003437-TA2e-17853.60% 
DrosophilaCG42269-PE1e-13244.42% 
EBI UniRef50UniRef50_E2AZ432e-13242.70%Ectonucleotide pyrophosphatase/phosphodiesterase family member 4 n=15 Tax=Pancrustacea RepID=E2AZ43_CAMFO
NCBI RefSeqXP_967323.23e-13844.57%PREDICTED: similar to AGAP006609-PA [Tribolium castaneum]
NCBI nr blastpgi|3454890962e-13743.89%PREDICTED: organic cation transporter protein-like [Nasonia vitripennis]
NCBI nr blastxgi|1582961572e-13745.68%AGAP006609-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550851e-33transmembrane transport
GO:00160211e-33integral to membrane
GO:00228571e-33transmembrane transporter activity
KEGG pathway 
InterPro domain[1-525] IPR0161968.4e-52Major facilitator superfamily domain, general substrate transporter
[139-523] IPR0058281e-33General substrate transporter
Orthology groupMCL34434 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202161-TA
ATGGATACTTCTAATAAAGCGAATTCTATTGAAATCGAAGATATTAGTACCGCTTTAGAGAAAGTTTTAAGTCACGTGGGGGAGTTGGGGACATATCAAAGGTGGCTATTCCTTGTGATGCTGCCCTTTGGAATGGTGTGGTCTTTCGGTTATTTTGGTCAGATGTTTATCACAGCAACTCCCCAAGAGCATTGGTGCAGGGTGCCAGAATTAGATGGATTGAGTGCCGATCTTAGGAGGTCTCTGGCAACACCGAAGGATTCTAACAATGTGTATGATAATTGTAACATGTTTGTTGCTAATTATACGCTTGTTTTGGAAACGCTGCTCCCGCCGGACCCTGGCACCCCGACTATGCCTTGTGATAACGGCTGGGAGTTCTTATTTGAAGACATACCATATTCTACTATCGTCAATGAGCGTGAATGGGTATGTGATAGATCAAACTTAGTGCCGTGGTCTCAAACAATCGGTTTCCTTGGATCAATCGTGGGTGGAGTTTTGTGTGGAACTTTGGCTGATAGGTATGGAAGATTGCCAGTTTTAATTTTGTCAAATGTATTTGCATTTGTAGGCGGTATTGGCACTATGTTTACTAACGGATTCTGGGACTTCTCTATATGTAGATTCATAGTGGCCATGTCATGTGACAGTTGTTTCATAATGATATATATTTTAGTTTTAGAGTACGTAGGGACAAAGTATCGCTCTCTCATAGGGAATTTGTCAATTGCCATGTACTTTGGTGGTGGATGCCTCTTGATGCCATGGCTCGCTCTCTGGATCGCTGATTGGAAGTACTTTGTTCTGGCTACCTCTTTGCCAGCGTTGCTATCCTTGCTAACACCGTTCTTGGTGCCAGAGAGTGCGCGATGGTTAGTGTCGAAAGGCAGAACCGACGAAGCAGTGAAGGTGCTCAAACGATTCGAAAGGATAAACAAGTCTAAGATACCAGAAGAAGTTCTGGAGGAATTTATTTCTATTGCAGGTAAAACAAAGAATGAGGAGGAAAGCGTTTTGACTATATTCAAGACTCCGTCGCTGCGTGTTACGGTGATTTTTTTGATCATGACTTTCATGGGTGTTGCTGTCGTGTTCGACGGTATCATCCGGCTCTCTGAAAACCTCGGCTTGGACTTTTTCCTGACCTTCACTGTGACCTCAGCTACAGAGATTCCATCGATTCTCATCCTGATCGTCTTATTAGACAGATTAGGTCGGCGGTACATGGTAATGGGCCCTATGCTCATAGCCAGTATATTATCATTAATAGCTGCCTTCGTACCAAGAGGTATAACATCAGTCGCGTTGGCTGTAACGGCTCGCTTCTTCAACAATATGGCGTATAGTACCGTGATCCAGTGGACACCAGAACTTCTGCCAACACCGATGAGAGCGTCTGGAGCATCCTTCGTCCACATCAGCGCCTTCGCAGCAATCACCGTATCTCCATTTTTAATTTACTCTGACCGTGTATGGGAGGGTCTGTCTCTAATTCTGGTTGGTGTGATCGGCGTGTTAGCAGCAGGGGTGGCACTACTGGTCCCGGAGACCAAGGGCCGTAGCATGCCGCAGACTATGGACGACTGGAAGACGATCAATAACGACACCATATTCTCCAGGTGA

Protein sequence:

>DPOGS202161-PA
MDTSNKANSIEIEDISTALEKVLSHVGELGTYQRWLFLVMLPFGMVWSFGYFGQMFITATPQEHWCRVPELDGLSADLRRSLATPKDSNNVYDNCNMFVANYTLVLETLLPPDPGTPTMPCDNGWEFLFEDIPYSTIVNEREWVCDRSNLVPWSQTIGFLGSIVGGVLCGTLADRYGRLPVLILSNVFAFVGGIGTMFTNGFWDFSICRFIVAMSCDSCFIMIYILVLEYVGTKYRSLIGNLSIAMYFGGGCLLMPWLALWIADWKYFVLATSLPALLSLLTPFLVPESARWLVSKGRTDEAVKVLKRFERINKSKIPEEVLEEFISIAGKTKNEEESVLTIFKTPSLRVTVIFLIMTFMGVAVVFDGIIRLSENLGLDFFLTFTVTSATEIPSILILIVLLDRLGRRYMVMGPMLIASILSLIAAFVPRGITSVALAVTARFFNNMAYSTVIQWTPELLPTPMRASGASFVHISAFAAITVSPFLIYSDRVWEGLSLILVGVIGVLAAGVALLVPETKGRSMPQTMDDWKTINNDTIFSR-