Monarch geneset OGS2.0

DPOGS202967
TranscriptDPOGS202967-TA1185 bp
ProteinDPOGS202967-PA379 aa
Genomic positionDPSCF300259 - 330690-334553
RNAseq coverage126x (Rank: top 57%)
Annotation
HeliconiusHMEL0054402e-15676.50% 
BombyxBGIBMGA011390-TA7e-16875.20% 
Drosophilahoe1-PC1e-12456.20% 
EBI UniRef50UniRef50_Q8IGX62e-12256.20%RE09889p n=21 Tax=Arthropoda RepID=Q8IGX6_DROME
NCBI RefSeqXP_001965130.14e-12556.46%GF21545 [Drosophila ananassae]
NCBI nr blastpgi|1947660358e-12456.46%GF21545 [Drosophila ananassae]
NCBI nr blastxgi|1583000872e-12255.15%AGAP009284-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550851.4e-47transmembrane transport
GO:00157461.4e-47citrate transport
GO:00160211.4e-47integral to membrane
GO:00151371.4e-47citrate transmembrane transporter activity
KEGG pathway 
InterPro domain[1-317] IPR0046801.4e-47Divalent ion symporter
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202967-TA
ATGGATCTAGATCCGGTTCCTATCTTGATGCTGATGGCAATATATAGTAACATTGGTGGTACTGCGACTCCTGTTGGTGATCCACCCAATGTTATTGTGGCATCTAATAAAGCTGTCGTACAGGCGGGCATAAATTTCACTAATTTCACAGCACATATGTCACTGGGTATCCTAATGGTTTTCATACAGACGTCACTACAGGTGAAATTTATGTTCAGGGACACGAATAGGCTTAGGATAAGAATGCCTAAGGAAATACGAGATCTTCATCGTCTAATTTCAATTTGGCGCCGCGCGGCCGATTCCTTGCCGCATTCGAGCGACGGGGTTAACGCGGTACGAGAGAGACTTGAGAGGAAAATAAAGAAATTGACACAGAATTTAAACAGATTTATGAAAGAGAGCAGCAAACGTGCGTGCTCTAAAGAAACTTTCGTGACCACGTTAGAAGAATTAAAAGCTAAGTATAAAATACGTGACAGAAATCTTTTAATGAAATCGTCCGTGGCAATAGCTTTCGTGGTGATTGTGTTTTTCATGCACTCCATACCAGAACTCAATCGCGTGTCTTTGGGTATGACCGCTCTGCTGGGAGCCATTTTGCTGCTAACCCTGGCTGATAGATCTGATCTAGAGCCAATCCTTCACAGAGTAGAGTGGTCTACATTATTGTTTTTTGCAGCACTTTTTGTTCTTATGGAGGCCCTATCAAAACTTGGTCTTATCTCATTCATCGGAGGACTGTTGGAGCATCTCATCTTCAAAGTGGATGAAAAGTACAGAATGGGTGTATCTTTAATGCTTATATTGTGGGCTTCCGGAGCTATATCAGCTTTCGTGGACAACATTCCTCTAACAACGATGATGATACGAGTTGTTGTGGCCATTGGAAACAACCCCGCCTTGAATCTTCCCATGGGACCGCTTATTTGGGCATTACTCTTTGGAGCCTGTCTCGGTGGTAACGGTACGTTGATTGGTGCTAGCGCCAACGTGGTGTGCGCGGGCGTCGCTGAACAGCACGGCTATCGGTTCACCTTCATGCAATTCTTCAAAATAGGTTTCCCCATCATGATAGGACATCTGTGTGTATCTTCTGTCTACCTTCTGATATGTCACTGTGTTTTTGAATGGCATTGATCTCATATTTTATTGTTTTACAAATGTAATTATTATTTTTTTTGA

Protein sequence:

>DPOGS202967-PA
MDLDPVPILMLMAIYSNIGGTATPVGDPPNVIVASNKAVVQAGINFTNFTAHMSLGILMVFIQTSLQVKFMFRDTNRLRIRMPKEIRDLHRLISIWRRAADSLPHSSDGVNAVRERLERKIKKLTQNLNRFMKESSKRACSKETFVTTLEELKAKYKIRDRNLLMKSSVAIAFVVIVFFMHSIPELNRVSLGMTALLGAILLLTLADRSDLEPILHRVEWSTLLFFAALFVLMEALSKLGLISFIGGLLEHLIFKVDEKYRMGVSLMLILWASGAISAFVDNIPLTTMMIRVVVAIGNNPALNLPMGPLIWALLFGACLGGNGTLIGASANVVCAGVAEQHGYRFTFMQFFKIGFPIMIGHLCVSSVYLLICHCVFEWH-