Monarch geneset OGS2.0

DPOGS203603
TranscriptDPOGS203603-TA1290 bp
ProteinDPOGS203603-PA429 aa
Genomic positionDPSCF300063 - 232027-241174
RNAseq coverage186x (Rank: top 49%)
Annotation
HeliconiusHMEL0173203e-11253.16% 
BombyxBGIBMGA007284-TA2e-5956.45% 
DrosophilaIndy-PB2e-0918.33% 
EBI UniRef50UniRef50_B0WCP24e-1122.02%I'm not dead yet n=2 Tax=Culicidae RepID=B0WCP2_CULQU
NCBI RefSeqXP_001605390.14e-1420.77%PREDICTED: similar to sodium/dicarboxylate cotransporter, putative [Nasonia vitripennis]
NCBI nr blastpgi|3454845256e-1320.77%PREDICTED: protein I'm not dead yet-like [Nasonia vitripennis]
NCBI nr blastxgi|1700372581e-1422.07%I'm not dead yet [Culex quinquefasciatus]
Group
Gene OntologyGO:00160202.9e-10membrane
GO:00550852.9e-10transmembrane transport
GO:00068142.9e-10sodium ion transport
GO:00052152.9e-10transporter activity
KEGG pathway 
InterPro domain[197-412] IPR0018982.9e-10Sodium/sulphate symporter
Orthology groupMCL25331 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203603-TA
ATGGCAGCTCCTGCTGCCGGTTTCATACCTTTATTTATGTTACCCATGTTCGGAGTACTTGATACAGTGAAAGTGGCAGGAAGCTATTTTAATGATAGTATTTGTTTATTTGTTGTTGCCGGAATGCTGCAAATGTTATTGAATTCCTCCGGCTTTGATCGTCGAATTATATTATGGCTGTTATGTTCTGGAGACAACTGTCAATTTGCAGGAAAAAGACTCATATTTAAAGCTTGCACGGCGGCATTTTTTCTTTCCATGTTTTCTAACAGACTAATTACGTCATCAACTATAATTCAATTTTTGACGCCAGCTTTAACGAATTTGCAGTCTTTGACTTCAAAAAATAGAAAAGAAGAGCCAAACTACGATGAAATGAGATTTATTATCCTAAACGCTGTTCAAACTTCTGCATCAATAGGAAGCATCACCATCATCCATGCAGCTTTGACCCCTATAGCTATGAGAGCCATTTGGTCTTCTGTAATTGGTAAAGCGATGAGTGCAAACAGCATGACAGAAATGAGAAATTTTCTAGTAAAACAACAAAATGCCTTGCCACCCAGAAGCTGTTTTGAAATGGAAACTGTATTTTTTTACCTATTTTTTTTGGTGGTATGTTTATTTCGATGGAGTGAATGGTTAAACATCGGTTGGGCGACTTACAATAGCGACATGCCTGGAGCACCAAAGATCAAAGATGCTACCGTCGCTTGTTTGTTTTTGGTGGCCCTTAGCATTTTGCCGCGATCGCATAACATTTTAAAATACATAAATGCTCAAAAAAAATCTGACTTAGGTAATGTAAAGCCCGAGTCTGCTATTTTGTGGTGGAAGTTTGTCGACAAAAATACATACTATGGCTACATTATACTATTGGGTGGTGGTGTGGCATTAAACACAGCAATTAAAACGACCGATCTTACAAAAGCAGTGACTTCGGACTACGGCACATTTATAACAAATAAATCATGGAACACTTCAATTTTTATTGTCTGCTTGATATCAGTTTTACTAGCGAACATTATGACTGGCATAGCTGCTTGTTGCACATTTTTGCCGTTTGTATTAAATATGGCAGGAGAAGGTGTATCAGAAGTATGGAAAAAACGATTGTATCTGGGGACATTAGGGGTTGGCATTGGAACATCATTTGGTTTTGTTAGCCCATTCTATTTCACTCCCGCATATTTTTGCCATCATACAGGGAAGGTACCTCTGAAGAAAATGGTAAATACATACTATAACATAACTTACATATTGGCACACAAGAATTTTAAAACAAAATAA

Protein sequence:

>DPOGS203603-PA
MAAPAAGFIPLFMLPMFGVLDTVKVAGSYFNDSICLFVVAGMLQMLLNSSGFDRRIILWLLCSGDNCQFAGKRLIFKACTAAFFLSMFSNRLITSSTIIQFLTPALTNLQSLTSKNRKEEPNYDEMRFIILNAVQTSASIGSITIIHAALTPIAMRAIWSSVIGKAMSANSMTEMRNFLVKQQNALPPRSCFEMETVFFYLFFLVVCLFRWSEWLNIGWATYNSDMPGAPKIKDATVACLFLVALSILPRSHNILKYINAQKKSDLGNVKPESAILWWKFVDKNTYYGYIILLGGGVALNTAIKTTDLTKAVTSDYGTFITNKSWNTSIFIVCLISVLLANIMTGIAACCTFLPFVLNMAGEGVSEVWKKRLYLGTLGVGIGTSFGFVSPFYFTPAYFCHHTGKVPLKKMVNTYYNITYILAHKNFKTK-