Monarch geneset OGS2.0

DPOGS206127
TranscriptDPOGS206127-TA2055 bp
ProteinDPOGS206127-PA684 aa
Genomic positionDPSCF300028 + 1004887-1021868
RNAseq coverage536x (Rank: top 23%)
Annotation
HeliconiusHMEL0028280.078.64% 
BombyxBGIBMGA000496-TA0.071.94% 
DrosophilaCG2930-PB0.053.27% 
EBI UniRef50UniRef50_B4NDF30.051.45%GK10225 n=4 Tax=Coelomata RepID=B4NDF3_DROWI
NCBI RefSeqXP_002011345.10.053.10%GI16478 [Drosophila mojavensis]
NCBI nr blastpgi|1951338360.053.10%GI16478 [Drosophila mojavensis]
NCBI nr blastxgi|1951338360.053.03%GI16478 [Drosophila mojavensis]
Group
Gene OntologyGO:00160201.2e-92membrane
GO:00068571.2e-92oligopeptide transport
GO:00052151.2e-92transporter activity
KEGG pathway 
InterPro domain[80-609] IPR0001091.2e-92Oligopeptide transporter
[1-370] IPR0161966.1e-20Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL10581 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206127-TA
ATGGCTGACAGGAATACAGGGAAATTACCATATCCGAAGGCAGTGGGTTTCATCGTAACAAATGAGTTCTGCGAGAGATTTTCATATTATGGAATGCGAACTATTCTGTCGCTGTACCTCCGCGACAAGCTCGGCTATGGTGATGACAACGCCACAGTTATTTATCACGTTTTCACAATGTTCGCATATTTCTTTCCCTTGTTGGGTGCCATGATAGCCGACGGATGGCTTGGAAGATTTAGGACGATTTTCTACCTGTCCCTGGTGTATGCGACGGGAAGCGTTCTAATATCCGTGGCAGCAATGCCTCCCGTCCAATTGCCACAATTAGAGTTCACAATCATAGCACTGTTCCTGATAGCCTTTGGCACCGGCGGTATAAAGCCCTGTGTGTCTGCGTTTGGCGGCGATCAGTTCAAATTGCCGGAGCAAGAGCGTTACTTGGGATATTTTTTCTCCCTTTTCTATTTCGCCATAAACGCTGGAAGCTTGATATCGACTTTTCTGACGCCCATTTTAAGAGCAGATGTTCATTGTTTCGGTGAAAATGACTGTTACTCTTTGGCTTTCGGCGTTCCCGGAGTACTCATGATCGTGTCCATAATTTTCTTTGTGGCCGGGAAGAGGTTGTACGTTACGAAAACACCCACAGGAAATGTTCTTGCAAAAGTGTCTAAATGCATCGGGCACGCTATAGTCAAGTCGTGTAAGAGCAAAGACAAGCGAGAGCATTGGCTGGATCGTGCGGACGACAAATTTGACACCAACCTCATAGAAGACATAAAGGCTTTGTTGAGAGTTTTGGTGCTATTCATACCGCTACCTGTGTTCTGGGCCTTATTCGACCAGCAGGGCTCCAGATGGACATTCCAAGCTGACAGAATGGAACAGGATATCGGAAGTTGGACACTAAAGGCTGATCAAATGCAAGTTTTGAATCCACTGCTTATATTGGTCTTCATCCCGCTCTTTGAAGTGGCGATATACCCGTTCTTGACATGGTGCAAGCTCATTAAGAAACCCTTGCATAAAATGATCTGGGGTGGAATCCTGGCCGCGTTTGCGTTCGTTATATCTGGAATTGTTGAGCTGCAACTGCTGCCGACCTACGGTACACCCGTGGCGGAGGGTTTGGCGCAACTGAGATTATACAATGGACAAAATTGTAATTACACATTGAATTTCACTCTTGACAATAAAACTGAGAGCTTTATGATCGGACCGCTGGGATATTACGAGAAACTGGATATAAGTGCACATGATTTTGTTGAACTGCCGTATTCTTTGCGAGGAGAGAGAAACAGTGACTGCGAAAATTCTGAATTCAACGGGACGTTTGATTTAAAAGAAAACATTGCAAACTCCTTCTTTGTACTCAACGAATCCTTGAGAGGTTTTGAAGATAATAATGACAAGGCTATAGACGGAGTCAATATAAGGTTTTTATCTAACGTTATATCACCGGTGTCCATTAATATTGAAAGCAATAAGGGTAATAAGACAAAAATGACGCTGAACAGCTCAAATATGTCGCAGATACATTTGTCTAAAGGGACTTATAATATTTTGGTGGGTGACGTAGAAATTTTGACAGGATATTCATTGAAAGCGGGTGCTGTTTATACTTTTAACTTATATGAAACGAATAACGGCTCTTATCTGATAAACCCTATAATGATAACTCCAGCCAACTCTGTTCACATCCTGTGGCTGGTGCCGCAGTACGTGGTTATGACTATGGGTGAAGTGATGTTCTCAGTGACGGGGCTCGAGTTCTCTTTCACTCAGGCACCCGCTACTATGAAATCCGTTTTGACCTCCGTGTGGCTACTTACAGTTGCTTTCGGCAACCTTATTGTAGTTCTCATAGTAGAGGGGAAATTTTTAGACGCTCAGTGGAAGGAATTCTTCTTGTTCGCTGGACTTATGATGTTGGACATGTTTATATTTACATCTATGGCTTTTAGATATAAATACGTCGAATACAAATCGAGCACAGAAGAATTGGCCGTTGAAGAAATAAAACTACCCGACAAACAGATCAAAGAAACTTAG

Protein sequence:

>DPOGS206127-PA
MADRNTGKLPYPKAVGFIVTNEFCERFSYYGMRTILSLYLRDKLGYGDDNATVIYHVFTMFAYFFPLLGAMIADGWLGRFRTIFYLSLVYATGSVLISVAAMPPVQLPQLEFTIIALFLIAFGTGGIKPCVSAFGGDQFKLPEQERYLGYFFSLFYFAINAGSLISTFLTPILRADVHCFGENDCYSLAFGVPGVLMIVSIIFFVAGKRLYVTKTPTGNVLAKVSKCIGHAIVKSCKSKDKREHWLDRADDKFDTNLIEDIKALLRVLVLFIPLPVFWALFDQQGSRWTFQADRMEQDIGSWTLKADQMQVLNPLLILVFIPLFEVAIYPFLTWCKLIKKPLHKMIWGGILAAFAFVISGIVELQLLPTYGTPVAEGLAQLRLYNGQNCNYTLNFTLDNKTESFMIGPLGYYEKLDISAHDFVELPYSLRGERNSDCENSEFNGTFDLKENIANSFFVLNESLRGFEDNNDKAIDGVNIRFLSNVISPVSINIESNKGNKTKMTLNSSNMSQIHLSKGTYNILVGDVEILTGYSLKAGAVYTFNLYETNNGSYLINPIMITPANSVHILWLVPQYVVMTMGEVMFSVTGLEFSFTQAPATMKSVLTSVWLLTVAFGNLIVVLIVEGKFLDAQWKEFFLFAGLMMLDMFIFTSMAFRYKYVEYKSSTEELAVEEIKLPDKQIKET-