Monarch geneset OGS2.0

DPOGS211256
TranscriptDPOGS211256-TA1548 bp
ProteinDPOGS211256-PA515 aa
Genomic positionDPSCF300425 + 12987-16596
RNAseq coverage221x (Rank: top 45%)
Annotation
HeliconiusHMEL0126900.084.20% 
BombyxBGIBMGA005433-TA0.084.63% 
DrosophilaCG9706-PB1e-14552.08% 
EBI UniRef50UniRef50_UPI000206355A2e-15556.10%UPI000206355A related cluster n=2 Tax=unknown RepID=UPI000206355A
NCBI RefSeqXP_001661369.13e-17355.39%hypothetical protein AaeL_AAEL002349 [Aedes aegypti]
NCBI nr blastpgi|3504262995e-17255.60%PREDICTED: acetyl-coenzyme A transporter 1-like [Bombus impatiens]
NCBI nr blastxgi|910893017e-17058.42%PREDICTED: similar to CG9706 CG9706-PA [Tribolium castaneum]
Group
KEGG pathwaydre:3940836e-152 
 K03372 (ACATN, SLC33A1)maps-> Glycosphingolipid biosynthesis - ganglio series
InterPro domain[1-500] IPR0161966.8e-18Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL13542 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211256-TA
ATGTCTTTAACTAAAAGAAAGCGTTCTCATGAAAAGGAAGAATTGCTTGAAAATGGTGATCCAGTTTTTAACCAAAGTAATATAAAAGGAGATGAATTAAACATAGCAGTTTTACTTTTTCTGTATACACTTCAAGGCATTCCGTTGGGTCTGGCTGGAGCTGTGCCTATGTTATTACAAAATCGTGGCATTACATACACTCAACAGGCTGAGTTTAGTTTTGTCAATTGGCCATTCAGTGTTAAACTGTTGTGGGCTCCCATAGTGGACGCATTGTTCTGGCCGAAGTTTGGGAGGAGGAAAACATGGCTTGTTCCCATACAATATTTGATTGGTATAGGCATGATTTTGATGTCAACATGTATAACCGACTGGTTGGGGTCAGAAGAAAAGGCACCATCAATGGCAATACTAACTGTAACATTTTTACTACTTAACTTTTTGGCCGCCACTCAAGATATCGCTGTTGACGGTTGGGCTCTGACTATGCTTAAGCGGTGTAATGTTGGTCATGCCTCTACATGTAACACGGTTGGACAAACGGCTGGATTTTTCCTTGGATATGTATTATTCTTAGCTCTGGAATCTCCATATTTTTGCAATAAGTATTTAAGGACGGTACCAGAAGAAACCGGCCTCGTAACTTTAGCTAGCTTTTTGCTATTATGGGGCTGGGTTTTTATTGTTACGACCACTTTTATAGCAATTTTCAAACACGAAGCCAACGACACTGTTACTACAAAGGAAAGTCAAACAAAGGGTATGAAAGATATTGTGAATGCATACAAACAACTGTATACTATTGTGAAATTGCCAGCTGTTAGAACATTGACTTTAGTGTTATTCACGGCTAAGCTCGGTTTCTGTGCAAGTGATGCTGTGTCTGGCTTAAAACTAGTCGAAGCTGGTGTTCCTAGAGAAGATTTGGCGCTCCTAGCAGTACCGCTAGTTCCTGTACAAATTATCATGCCTGTGATACTAGCAAAACACACAACTGGCCCAGCTCCGTTATCCCTCTGGCTTCGGGCGTTCCCACTACGATTGTTGGTTGGACCTCTGGCTGCCATACTTGTAGCCTTAACACCGACACTACTATCAGATAGCGGCCCATCCTATTCATATTTGTTCATTCTTATGTCACTTTATGTATTTCATCAGACCTGCTTGTACTGTATGTTCGTGGCTGTTATGGCGTTCTTCGCTAAGGTCTCCGACCCGTCCGTCGGCGGAACATATATGACCTTATTAAACACTGTGTCAAATCTGGGAACCAATTGGCCTAACACGTTAGCATTATGGGCCATAGACCATCTTACATTTAAATCCTGCAGTGGCTCTACCTTAATTGATAATACTTGTGCATCGCCCTTGGAAACTGAGGAGTGTAAAGCAAACGGCGGCACGTGTAATATAAGAATCGACGGTTTTTATATAGAAGTAGTGATATGTCTTATCGCTGGATTCCTCTGGCTTCAATGGGGAAGAAAAACTATAAGCAGACTCCAGCGATTGCCGTCATCCTCATGGCAAATAAACCGTAACAGATAA

Protein sequence:

>DPOGS211256-PA
MSLTKRKRSHEKEELLENGDPVFNQSNIKGDELNIAVLLFLYTLQGIPLGLAGAVPMLLQNRGITYTQQAEFSFVNWPFSVKLLWAPIVDALFWPKFGRRKTWLVPIQYLIGIGMILMSTCITDWLGSEEKAPSMAILTVTFLLLNFLAATQDIAVDGWALTMLKRCNVGHASTCNTVGQTAGFFLGYVLFLALESPYFCNKYLRTVPEETGLVTLASFLLLWGWVFIVTTTFIAIFKHEANDTVTTKESQTKGMKDIVNAYKQLYTIVKLPAVRTLTLVLFTAKLGFCASDAVSGLKLVEAGVPREDLALLAVPLVPVQIIMPVILAKHTTGPAPLSLWLRAFPLRLLVGPLAAILVALTPTLLSDSGPSYSYLFILMSLYVFHQTCLYCMFVAVMAFFAKVSDPSVGGTYMTLLNTVSNLGTNWPNTLALWAIDHLTFKSCSGSTLIDNTCASPLETEECKANGGTCNIRIDGFYIEVVICLIAGFLWLQWGRKTISRLQRLPSSSWQINRNR-