Monarch geneset OGS2.0

DPOGS213633
TranscriptDPOGS213633-TA2649 bp
ProteinDPOGS213633-PA882 aa
Genomic positionDPSCF300165 - 356115-371489
RNAseq coverage733x (Rank: top 18%)
Annotation
HeliconiusHMEL0060540.083.97% 
BombyxBGIBMGA004574-TA1e-15771.04% 
DrosophilaCG10413-PA4e-12857.18% 
EBI UniRef50UniRef50_UPI00022467B50.046.23%UPI00022467B5 related cluster n=1 Tax=unknown RepID=UPI00022467B5
NCBI RefSeqXP_001601569.10.046.05%PREDICTED: similar to cation chloride cotransporter [Nasonia vitripennis]
NCBI nr blastpgi|3454838910.046.23%PREDICTED: solute carrier family 12 member 9-like [Nasonia vitripennis]
NCBI nr blastxgi|3454838910.046.23%PREDICTED: solute carrier family 12 member 9-like [Nasonia vitripennis]
Group
Gene OntologyGO:00160205.3e-49membrane
GO:00068105.3e-49transport
GO:00550855.3e-49transmembrane transport
KEGG pathwayrno:836292e-39 
 K10951 (SLC12A2)maps-> Salivary secretion
    Vibrio cholerae infection
InterPro domain[127-506] IPR0048415.3e-49Amino acid permease domain
[8-60] IPR0011639.9e-13Like-Sm ribonucleoprotein (LSM) domain
[5-61] IPR0109202.2e-12Like-Sm ribonucleoprotein (LSM)-related domain
[7-62] IPR0066492e-11Like-Sm ribonucleoprotein (LSM) domain, eukaryotic/archaea-type
Orthology groupMCL12104 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213633-TA
ATGTCTAAGGCACATCCTCCAGAGTTGAAAAAGTTCATGGACAAGAAGCTGTCCATCAAACTAAACGCTGGTCGTGCTGTGACGGGCGTTCTTAGAGGATTTGATCCTTTCATGAACTTGGTACTTGATGAATCTGTTGAGGAATGTAAGGACGGGCAGAGGAATAATATAGGGATGGTCGTAAGGACAAAGCCATCTATAATGGCGGATACCCTGGACCCATTGTCACCGAGTACAAATGACTCCTCACCAGGCTTAATGCATATATTGAGACAGAGGCTGGGCAGGGCACCAACGGGTTCTAACGGAGATGGATATGTCGAGTTTGGTGAGTTCAGATCAGAACGGCCAGGTGTCAAGAAGGTGGGCACCTTCGCGGGAGTGTTCTGTCCTGTTGTCTTATCAATGTTCAGCGCTCTCGTGTTCATAAGAATGGGATATCTCATTGGTAACGCCGGTCTTCTGGTGACTCTGGCGCAATTCGGCATGGCGTATTTAATAGTTGGCTTCACGGTAACATCGATCTGTGCCATTTCCACGAATGGGGCTGTGGAGGGCGGAGGCGTGTACTTCATGATAAGCAGGACTCTCGGTCCCGAGTTTGGTGGTGCTATCGGAACTCTCTTCTTCTTCGCCAACGTGGTGTCCAGTGCTCTCTGTATATCAGCCTGCGCTGAGGCTATGGTGGAGAATTTTGGGGACGATGGTGGATACCTGATCGGTTCCAGCCCTGGCTTGCCGAGTGGTTACTGGTACAATTTCCTGTACCGCTCGTCGTTGAACGCGGTGGGTCTAGGCGTGTCCCTGGCCGGAGCCTCATTATTCGCGTCAACGAGTCTAGCGATCTGGTTGACGACCATCATTTGTCTGTTCAGCGCCTTCCTCAGCTTCTTCATAACAGCTCCCGGACAGATAGAAAAGCCAGCTTCAAACACTATAGTGAACGCTACCAACTTCACCTACACCGGTCTGAGTTCGGTCACGTTACGAGAGAACCTCTATCCTAACTACTCCCGCGACTACACAGCTGATGGGGAATTCGTGGATTTCGCGTCAGTTTTTGGGGTGCTCTTCACTGGGGTCACCGGGGTCATGGCTGGGGCGAATATGAGTGGCGAATTGAAGAATCCATCTCTGAACATACCCCGCGGTACGTTGGGTGCTCTGCTTCTAACAGCTTTGACGTACCTCTCGTTGTCGCTACTGACTGCTGCGACGTGTTCGAGAGAGTTGCTCCAGAACAACTACGTGTACCTCCTGCCTATCAACATCTGGCCGCCGTTCATAGCGGTGGGTATGCTGACCGCGACCTTCTCAGCTGGGTTGTCAAACCTTATTGGAGCTTCCAGAGTACTCGAAGCGTTGGCCAAAGACGATATATTCGGATTCCTCCTCCGCCCCATGGTGTCGACTTCCGGTAACCCCGTGCTAGCGGTCATCGCCTCCTGGCTGCTGGTACAGTTTGCCATAATGGCAGATTCGCTGAACGCAATCGCTCAGGTCCGTAAGTACCTCCTGCTGCTGGATCCTCGTCGTCAGCACGTGAAGTTCTGGAGACCCCAGATGTTACTCCTGGTCGCGTCCCCGAGACACTGTGCTCCCCTCATCGACTTCGTTAACGATCAGAAGAAGGGCGGTCTGTTCGTGTTGGGTCATGTTCGTGTCGGTGAGCTGGACGGCAGTGGGGATCCTCTGTCTGATGAACACAAGTACTGGCTTCAGCTGATAGACCACCTCAAGGTGAAGGCTTTCGTGGAACTCTGTCTGTGTGAATCTGTGCGAGGGGGGGCCGCCCAGCTATCGCGGCTCTCTGGGCTTGGAGCTATGAAGCCGGATACTGTACTCCTGGGATTCAGGGACCAGGCGCCGCATAGAGATTTCTTCAGGGACCCCTCGTCACCTTACAAGACGGCTATGTTTGACCTGGAGGGCGGGGAGGTGGTGTTCCCCGCCAGATCCTCCAAGATCTCCGTCACGGAGTACGTTAGGATCGTGTCTGATGTCCTCTGTGTCGGGAAGAACGTCTGTCTGTGTCGACATTTCCACAAACTCGATATGGACGCGATCGCCAAACGCTCCTCGTCATCTCGGTCCATCGACGTGTGGCTGGTTGAGCCTCTCCGTCCATCTCGCGAGGAACCGTTCTCTGTCCGAGCTCTGTTCGCGCTGCAGCTAGCGGCCGTCGTTCGCTCAGCCAGGGGCTGGACCCGTCTCGGCCTGAGAGTGCATATCATAACAGGGGTGTCTACTATCGGAACCCTACCCTCGTCCCCTGATCAGTTGTCTCCTGGCCGTCCAATCACCGAGCGTCTCGAACAGCTCCTCAAGATGCTCAGAATCAATGCCACCATACATCCCGTTCCTGAATGGCCTTCATTAGAGGGGTCTCATCGCTGGGCGGACCTCGACGACGATCAGGTCTATCAGAGAGTGCCCCTGAACTATTTACAAAAAGTGAACTCTATAATAAAGGCTCGCAGTTCAGAGGCGGTGGTGACGTTCATCCAGCTCCCCCCTCCCCCGCCCAGCGTCAACAGGGACGACGACATATGTAGTGATTACTTGAAGACTTTAGACGAGCTCACCAAGGACCTGTCGCCCACGATCCTCGTCCGGGGACTGAAATCCGTGACATCAACATCCTTGTAA

Protein sequence:

>DPOGS213633-PA
MSKAHPPELKKFMDKKLSIKLNAGRAVTGVLRGFDPFMNLVLDESVEECKDGQRNNIGMVVRTKPSIMADTLDPLSPSTNDSSPGLMHILRQRLGRAPTGSNGDGYVEFGEFRSERPGVKKVGTFAGVFCPVVLSMFSALVFIRMGYLIGNAGLLVTLAQFGMAYLIVGFTVTSICAISTNGAVEGGGVYFMISRTLGPEFGGAIGTLFFFANVVSSALCISACAEAMVENFGDDGGYLIGSSPGLPSGYWYNFLYRSSLNAVGLGVSLAGASLFASTSLAIWLTTIICLFSAFLSFFITAPGQIEKPASNTIVNATNFTYTGLSSVTLRENLYPNYSRDYTADGEFVDFASVFGVLFTGVTGVMAGANMSGELKNPSLNIPRGTLGALLLTALTYLSLSLLTAATCSRELLQNNYVYLLPINIWPPFIAVGMLTATFSAGLSNLIGASRVLEALAKDDIFGFLLRPMVSTSGNPVLAVIASWLLVQFAIMADSLNAIAQVRKYLLLLDPRRQHVKFWRPQMLLLVASPRHCAPLIDFVNDQKKGGLFVLGHVRVGELDGSGDPLSDEHKYWLQLIDHLKVKAFVELCLCESVRGGAAQLSRLSGLGAMKPDTVLLGFRDQAPHRDFFRDPSSPYKTAMFDLEGGEVVFPARSSKISVTEYVRIVSDVLCVGKNVCLCRHFHKLDMDAIAKRSSSSRSIDVWLVEPLRPSREEPFSVRALFALQLAAVVRSARGWTRLGLRVHIITGVSTIGTLPSSPDQLSPGRPITERLEQLLKMLRINATIHPVPEWPSLEGSHRWADLDDDQVYQRVPLNYLQKVNSIIKARSSEAVVTFIQLPPPPPSVNRDDDICSDYLKTLDELTKDLSPTILVRGLKSVTSTSL-