Monarch geneset OGS2.0

DPOGS202940
TranscriptDPOGS202940-TA1344 bp
ProteinDPOGS202940-PA447 aa
Genomic positionDPSCF300220 + 235229-243544
RNAseq coverage51x (Rank: top 70%)
Annotation
HeliconiusHMEL0142163e-9670.39% 
BombyxBGIBMGA001908-TA3e-9175.00% 
DrosophilaCG15890-PA1e-8235.57% 
EBI UniRef50UniRef50_B3N1F76e-8234.98%GF21601 n=4 Tax=Endopterygota RepID=B3N1F7_DROAN
NCBI RefSeqXP_001659917.13e-8436.34%adenylate cyclase [Aedes aegypti]
NCBI nr blastpgi|1571220075e-8336.34%adenylate cyclase [Aedes aegypti]
NCBI nr blastxgi|3287112234e-8435.62%PREDICTED: proton-coupled folate transporter-like isoform 1 [Acyrthosiphon pisum]
Group
Gene OntologyGO:00550852.9e-14transmembrane transport
GO:00160212.9e-14integral to membrane
KEGG pathway 
InterPro domain[41-437] IPR0161969.3e-21Major facilitator superfamily domain, general substrate transporter
[42-338] IPR0117012.9e-14Major facilitator superfamily
Orthology groupMCL20686 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202940-TA
ATGCATTTGGAAAAATCGTGCCGCATAAATTTGAAATTAGGTGATGAAGTATGCGATCGCATCAAGAACAGAAACACAACAGGATTGGAATTGGAATTAAATCAGGTGCAAACATTAGTAGCTCAGGTAGTAGCTTGGAAGTTCCCCCTTCAAACCGCAATACCAGCTGTAATGGTCCTGTTTGTTGGTGCATGGAGTGATAAGACGAAGAAACGAAAAATTTGTATCCTGTTTCCATTCCTTGGAGAGATAATTGCGAATATCGGACTTATATTTGCTACCTTTTTTAAATTATCTCTAACAGTGACTGCCCTTATAGAAGCATTACCATCAGCGTTCACTGGTAGCTATATTATCATATTTATAGGAATGTACAGTTTTATGGCTGATCGGACCTCAGTGGAAAGCAGGACGTTTCGTCTAGGACTTGTAACTATATGTGTTACTTTAGGCACACCAACTGGGACTGCCCTGAGTGGGATACTCTTACAGGTTTTAGGTTACTACGGAATGTTCTCAATGCTAATAATACTTTATTCTTCATCACTTCTGTATGGTTTTATACGACTTGAAGATATTTTACCAATCGAGGATAACGTAACGTCAGACTGTGGGAATGGTAGTTTCAAGAGATCTGTTAAGGAGGTGTTTGGTCTGGTTGCCAACACTGTGTTAGTGGTTTTCAAACCTCGTGCTTTTGGGATGAGAAAAAGAATATTGGCAGTTATCGTCCTTTACGTTATTATGGTGGGACCGTTGTATGGTGATTCTCAAGTCGGTTATTTGTATGCGATACATAAATTTAAATTTCATGAAATCGAGTACACATTGTATGGAACTATAAATATCATTTTTGGAATGTCTGGAACATTCTTTTGTATAACCCTTTTATGCAAGAGGTTGCATGTGCAGGACAGCCTTATCGGTCTCCTCGCAGGAGTAAGCAGGATAGCTGGTTGTTTGGTGTTAGCCTTCGCTCCGAACAGAGCCTGGTACTATTCGGCACCTATTTTCAGCATATTCAGTCACACTGGTCTCACCGCGGTGAGATCCATAGCTACCAAAAGTGTTCCGGGTGACGAAGTCGCCAAATTAAGTTCAGTGATTGGTGTAATGGAAGCAATAGCGCCTTCAGTGTACATGCCGACATCGAGTTTTATATACGTCAATACCTTGGACACGTTTCCTGGCGCCTTTTATTTGTTCGACGCCATGCTCACTGTATTCGCCTTAATATTATTTTCATATATTTACGTTTTGGTGAGACGAATTGAGAGGGACATGGTTAGAGATCATAGCAGGAAAGAGGAATTCGCTAGAACCAATGAAGTATCTAGATTTTAG

Protein sequence:

>DPOGS202940-PA
MHLEKSCRINLKLGDEVCDRIKNRNTTGLELELNQVQTLVAQVVAWKFPLQTAIPAVMVLFVGAWSDKTKKRKICILFPFLGEIIANIGLIFATFFKLSLTVTALIEALPSAFTGSYIIIFIGMYSFMADRTSVESRTFRLGLVTICVTLGTPTGTALSGILLQVLGYYGMFSMLIILYSSSLLYGFIRLEDILPIEDNVTSDCGNGSFKRSVKEVFGLVANTVLVVFKPRAFGMRKRILAVIVLYVIMVGPLYGDSQVGYLYAIHKFKFHEIEYTLYGTINIIFGMSGTFFCITLLCKRLHVQDSLIGLLAGVSRIAGCLVLAFAPNRAWYYSAPIFSIFSHTGLTAVRSIATKSVPGDEVAKLSSVIGVMEAIAPSVYMPTSSFIYVNTLDTFPGAFYLFDAMLTVFALILFSYIYVLVRRIERDMVRDHSRKEEFARTNEVSRF-