Monarch geneset OGS2.0

DPOGS212614
TranscriptDPOGS212614-TA1569 bp
ProteinDPOGS212614-PA522 aa
Genomic positionDPSCF300245 + 137215-140160
RNAseq coverage38x (Rank: top 73%)
Annotation
HeliconiusHMEL0024734e-16654.35% 
BombyxBGIBMGA009055-TA9e-11753.66% 
DrosophilaCG7442-PA5e-6830.02% 
EBI UniRef50UniRef50_D2CG571e-7534.46%Putative uncharacterized protein GLEAN_10997 n=1 Tax=Tribolium castaneum RepID=D2CG57_TRICA
NCBI RefSeqXP_391853.26e-7932.32%PREDICTED: similar to CG7442-PA [Apis mellifera]
NCBI nr blastpgi|3838630031e-8132.70%PREDICTED: solute carrier family 22 member 21-like [Megachile rotundata]
NCBI nr blastxgi|3838630032e-8332.95%PREDICTED: solute carrier family 22 member 21-like [Megachile rotundata]
Group
Gene OntologyGO:00550855.4e-28transmembrane transport
GO:00160215.4e-28integral to membrane
GO:00228575.4e-28transmembrane transporter activity
KEGG pathway 
InterPro domain[130-518] IPR0161962.6e-45Major facilitator superfamily domain, general substrate transporter
[138-506] IPR0058285.4e-28General substrate transporter
Orthology groupMCL21046 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212614-TA
ATGGTCCTTAGTCGTGAGGCTGCTGGTCCCCAAACGGCCAGTGTTACGGATATTTTGGTCAAGTTTGGAAGATACCAAATATTTCAATACATTCTGGTCAGTTTACCGATACTTTTTCTATCGATAAATAATGTTAATTATATATTCGTGGCTGGTGATGTCCAATACAGGTGTCGGATAACGGAATGCGATTCCTCAAATGCAACATACTCCGTCCCCTGGTGGCCGAGCAACATCGACCCATGCACCAAACCTATTTTAAACCACACTTATCTAAACAGTGGTGTCTGCACTAACCGCAGTTTTACGGGGGCTGTAGAAAAGTGCACGGATTGGATTTATGAGAACCATGACACTATAATTCCTGAGTTCAACTTAGCATGTAAACCTTGGATGAAAAACTTGGTGGGATTAGTTCACAGCGCTGGTATGGCCTTGGCGATGCTTATTGGTGGCGTAATGGCTGACCATTTGGGTCGCAGAACTACTTTGATTATTTGCGGTCTAGGTTGTTTTGTAGGCAACTTCAAGACCCTAGCGTCTAACTATAACTTATATATATTTATAGAGTTTTTAGAAGCTGTTATTTCCGGAGGAGTGTACACATCGGGGGCTGTTTTAATGGTTGAAATTGCGGGTTTAAATAAAAGAGTCCTTGCTGGAGTTTTATTCTCATATTCAATATATTTGGGAGAAGCAATTTTCGCATTAGTCGGAATGGTAGTGCCGTATTGGAAAACACTAATCTATATAATTTGTTCTCCGCCTATAATATTTCTTTCCTATTTTGTACTGGTAAATGAAAGCCCTCGATGGCTTATCCTAAAAGGAAAAAGTGATCAGGCAAAAGAAGTTTTGAAATCAATGGCAAAAACTAATAAGATTAATATTGATTTTGAAGATTTATATAACATGGACGAAGAAAAACTCAAATCTATGTTCAACCTCGGAGGACCAGTAAAGAAACAAACACTTATTGAGGCGTTCAAGAGCAGAGAAATTGTGAAGCGATTTTTAGTTGCAGCTTACTGCAGATTTTCTTGTAGTTTTGTTTATTACGGGCTATTGATTAATTCTGTGGTACTGCCTGGAGACAAATACACAAATTTCTTTTTGGCAGCTATGATGTCATTTCCTGGTGAATTAATAAGCATGTTCTTCATGAACAAAATTGGTAGAAAGTTACCATTGATAACTGGATTTCTATTGGGTGGGATGTCATGTATGGTATCTGGATATGCCACACACACATGGTTCAAAATTTTTCTGTTTCTTTTGGGGAAGCTGGTAGCATCGTCGTGCTATACAGGAGTCGTTACTTACACGGTTGAGTTGTTTCCAACAAGCGTTCGTGGATCGCTGATTGGGTTCTGCGCTCTGGCATCAGCATTCGGAAACATGTTGTCTCCCCTGACTCCTGCTTTGTCTCCTGCCACTGCTTCCATTCTTTTCGGATGCGCGGCGTTCTTAGCAAGCAGTCTTCTGTGCCTCACTCCTGAAACCAAAGACACTCCTCTCCTGGACACTATTGAACAAATTGAAAGCAAGAAACAGAGGAATGATAAATAA

Protein sequence:

>DPOGS212614-PA
MVLSREAAGPQTASVTDILVKFGRYQIFQYILVSLPILFLSINNVNYIFVAGDVQYRCRITECDSSNATYSVPWWPSNIDPCTKPILNHTYLNSGVCTNRSFTGAVEKCTDWIYENHDTIIPEFNLACKPWMKNLVGLVHSAGMALAMLIGGVMADHLGRRTTLIICGLGCFVGNFKTLASNYNLYIFIEFLEAVISGGVYTSGAVLMVEIAGLNKRVLAGVLFSYSIYLGEAIFALVGMVVPYWKTLIYIICSPPIIFLSYFVLVNESPRWLILKGKSDQAKEVLKSMAKTNKINIDFEDLYNMDEEKLKSMFNLGGPVKKQTLIEAFKSREIVKRFLVAAYCRFSCSFVYYGLLINSVVLPGDKYTNFFLAAMMSFPGELISMFFMNKIGRKLPLITGFLLGGMSCMVSGYATHTWFKIFLFLLGKLVASSCYTGVVTYTVELFPTSVRGSLIGFCALASAFGNMLSPLTPALSPATASILFGCAAFLASSLLCLTPETKDTPLLDTIEQIESKKQRNDK-