Monarch geneset OGS2.0

DPOGS206269
TranscriptDPOGS206269-TA1449 bp
ProteinDPOGS206269-PA482 aa
Genomic positionDPSCF300290 - 225188-235078
RNAseq coverage154x (Rank: top 53%)
Annotation
HeliconiusHMEL0124373e-10845.63% 
BombyxBGIBMGA010801-TA0.088.89% 
DrosophilaCG1607-PA0.073.51% 
EBI UniRef50UniRef50_Q9V9Y00.073.51%CG1607, isoform A n=43 Tax=Bilateria RepID=Q9V9Y0_DROME
NCBI RefSeqXP_395239.30.073.64%PREDICTED: similar to CG1607-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838518140.073.68%PREDICTED: Y+L amino acid transporter 2-like [Megachile rotundata]
NCBI nr blastxgi|3838518140.073.68%PREDICTED: Y+L amino acid transporter 2-like [Megachile rotundata]
Group
Gene OntologyGO:00160205.8e-250membrane
GO:00033335.8e-250amino acid transmembrane transport
GO:00151715.8e-250amino acid transmembrane transporter activity
GO:00068101.7e-26transport
GO:00550851.7e-26transmembrane transport
KEGG pathway 
InterPro domain[1-478] IPR0022935.8e-250Amino acid/polyamine transporter I
[10-393] IPR0048411.7e-26Amino acid permease domain
Orthology groupMCL12034 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206269-TA
ATGTCGCTTCTTAATGGCATCACGGTTATTGTGGGCTCAATCATTGGCAGCGGCATTTTTGTTTCACCGACCGGTGTATTAAAATACACAGGCTCTGTTAATGCCAGCCTTATCGTATGGGTGGCGTCTGGGGTTTTCAGTATGGTAGGAGCATATTGTTACGCGGAACTTGGTACTATGATACGAGTCAGTGGAGCCGATTACGCTTATATCATGGAAACTTTCGGACCCTTCGCGGCCTTCATGAGGTTGTGGATTGAATGTATGATAGTTAGACCCTGCTCTATGGCAATCGTGGCCTTGACGTTTAGCACGTACGTGTTAAAGCCAATTTTCCCTGAATGCAGTCCCCCTGAAGACGCGACACGGTTGCTGGCCGCCTGTTGTATTTTATTATTAACGTTCGTCAACTGTTGGTCTATTCGCGCTGCGACACGTGTTCAGGATTGGTTTACTTACGCCAAACTGCTTGCACTCTTCATCATCATAGCTGCGGGTTTATACCAACTGAGCAGAGGGAAGGTGGAACACTTCACATTTGAAGGGACCACGAGTGACGTCACATCCATCGCCCTATCTTTTTATTCTGGATTGTTTGCTTACAATGGATGGAACTATCTTAATTTTATAATCGAGGAATTAAAGGATCCAGTGCGAAACCTCCCTCGAGCCATCGCCATCTCCTGCACCCTCGTCACCATCGTGTACACCTTCACCAACGTAGCCTTCTACACCACGTTGTCACCAACTGAGGTGCTGGGTTCGGCTGCTGTAGCTGTAACATTCTCTGAACGTCTCTTCGGCGCGTTCGCTCTCTCTATACCAATGTTCGTAGCTGCGTCCACTTTCGGCGCTGTTAACGGCGTGCTCTTGACTTCTTCAAGGTTGTTCTACGCTGGTGCTGCCCAAGGTCAAATGCCGGGCATGCTGACGATGGTCTCATCTCGCTCCACTCCCGCACCGGCTGTGATAGCTGTGGCCGTACTATCTCTCATGTATCTCACAGTGTCTGACATTTTCGCGCTAATCAACTACGTCGGATTCGCTACTTGGCTGAGCATAGGAGCGGCAGTTCTCTGTCTACCGGTACTGAGATACACTCAACCGAATTTAGAGAGGCCCATTAAAGTTAACTTGTTCTTCCCAGTGATATATATAATCTGTACGATACTTGTCGTCGCGTTTCCCGCGTGGGCGTCGCCCGCGGAGACAGGGGTCGGCTGTCTCATGATATTAACGGCTGTTCCTGTATACCTGTTGCTGCTGGAACCTAAAACGAGGCTCTCTGGACTTGGATTTATTACTGACGCCGCTACCCGTTTAATACAGAAATTGACGTTATCTGTGCGACCAAAGACAAAGGTATTTTTTTTTATTGACGATGTACGAAGAATAATAATTAATATTTTAAACATAGAATCTTACGAATATATATATATTTTTCAGTAA

Protein sequence:

>DPOGS206269-PA
MSLLNGITVIVGSIIGSGIFVSPTGVLKYTGSVNASLIVWVASGVFSMVGAYCYAELGTMIRVSGADYAYIMETFGPFAAFMRLWIECMIVRPCSMAIVALTFSTYVLKPIFPECSPPEDATRLLAACCILLLTFVNCWSIRAATRVQDWFTYAKLLALFIIIAAGLYQLSRGKVEHFTFEGTTSDVTSIALSFYSGLFAYNGWNYLNFIIEELKDPVRNLPRAIAISCTLVTIVYTFTNVAFYTTLSPTEVLGSAAVAVTFSERLFGAFALSIPMFVAASTFGAVNGVLLTSSRLFYAGAAQGQMPGMLTMVSSRSTPAPAVIAVAVLSLMYLTVSDIFALINYVGFATWLSIGAAVLCLPVLRYTQPNLERPIKVNLFFPVIYIICTILVVAFPAWASPAETGVGCLMILTAVPVYLLLLEPKTRLSGLGFITDAATRLIQKLTLSVRPKTKVFFFIDDVRRIIINILNIESYEYIYIFQ-