Monarch geneset OGS2.0

DPOGS206793
TranscriptDPOGS206793-TA2154 bp
ProteinDPOGS206793-PA717 aa
Genomic positionDPSCF300001 - 4687175-4694024
RNAseq coverage92x (Rank: top 62%)
Annotation
HeliconiusHMEL0121902e-18051.98% 
BombyxBGIBMGA000578-TA1e-11372.92% 
DrosophilaSug-PD1e-9342.16% 
EBI UniRef50UniRef50_D6WGI77e-10244.49%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WGI7_TRICA
NCBI RefSeqXP_970503.11e-10145.67%PREDICTED: similar to Sug CG7334-PA [Tribolium castaneum]
NCBI nr blastpgi|2700041163e-10144.49%hypothetical protein TcasGA2_TC003434 [Tribolium castaneum]
NCBI nr blastxgi|1571317419e-10543.29%hypothetical protein AaeL_AAEL012205 [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[1-686] IPR0161963e-51Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL14687 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206793-TA
ATGAATATCAATAAGAAGCTGTTGCCTATGAAAGGACATTTCTTTTTGTTTAATGCTGGTACTGCACCTGTTGTACCATATCTGTCCACATATGCCAGGCAACTAGGTTTTTCATCTGCTACAGTCGGATTAATATATACAGTCCTTCCGATATTTGGTCTGATCGCAAAACCACTTTTCGGTGTTATTGCTGACAGATTTAAAATACAAAAATCAATCTTCATATTATTTCAAGTAGTGACGATTGTAAGTTTTCTTGCCATTTATTTTATCCCAGAAACTGGGATGAAGACATCTGTTGAATTGGACTGTGGGAATGGTGTCACATTTCTGAGAAGTTGCTATGAATCAAATTCACAAGTAGATATGTGTAAGGTTACAGTGTTGGAGAACAAAAATGAGACTGCCGTTTGTAAGATGAAATGTGACATGACTTCACCAAAAATGTGGCAGACTGTGTGTGAACACTGGCATATACCTCAATATTGCTACAGCAATTCTAAAAACATTGAGTACTTAACACACATAAGCAATATTAAAATTAAAGACCAGTGTCTAAACATGGCTTCAAATAATGTCACACTTGATGGTTACACCTTCACCCCACACTGTCGTGTCGGAATGGGTTTTGTTGATATCAATGAGCCTTGTACACTAAGTTGTAAGAACGAGATGTTATCAAAACTCATTGGTGATAACAAACCAAACATGACCTGTGTTGATAATATGATGAATTTTAGACTGTGCACCAATAATAGTTTGCCAGTTAGTGACGAATTTGATGAGTGTAGAGCCTCCTGTGATCTAGATCCATCAACACCATGGAAATTAATGGAAATTTGTGAAGGTTGGGAGGCAGATGTCACAAGCGCCTGCCAGCCAACAAACAGTCACTTCCCAGATACACTTGAGTTCAACGGAACTATACTCCTTTCATCCACAATCACTGAACATCAGTGCGTCTATATACAGCTCAATCATATACAGATGCCTGATGGATCAATACATTACCCCAATTGTATCTACAAGAGTCAATACCAGGTCGAAGCGAACCTTTTCCACGCCTCCTGTGAGATAGAATGTGACAACGAAATGGTGAACGAGTTGTTTTCAACATCAGACGTCAAGCAGAGCGAGTACGGGCTGCAGTTTTGGTTGTTCTTCCTTATGATGATCATAAGCTGGGTCGGTCAGGCTGTTGTGGTGACCTTCGCTGACGCAATCTGCTTTGGATTGCTTGATACTAAGATCTCTCGCTACGGCAAGCAACGCCTCTGGGGTTCCGTTGGCTTCGGTATAATATCCCTAGTCACGGGAACACTAATAGATGTGTTCAGCAGCGGCGCTTACAAAGACTACACCATCGCGTTCGTTCTCATGACCGTATTTATGTTCGGTGATGTCGTTGTGTCCTGCTTCCTACAGGTGGAGTCAACAAAGATGTCGGTGAACATTTTAGCAGACGTGGGTTCATTGTTGACATCGCTGCCAACATTTGTGTTTCTGATATGGACTATATCAGTCGGTCTGTGTACGGGCTTGATATGGAACTTCCTGTTTTGGCATATCGAAGACATATCAGGGTTGAACTGTGAAGTTGAATATGTTAAGACGTTACAGGGCCTGGTTAGTGCCATACAAACATTTGGTGGAGAGATACCATTTATGTTCGTGTCGGGATATCTTCTGAAGAAAATCGGTCACATAAACGCCATGACGCTCGTGCTATTCGCTTTAGGCCTTCGTTTCATCCTATATTCCATACTGACCAACCCGTGGTGGATACTTCCCATAGAGATGTTCCAAGGCATCACATTTGGGATGTTCTATCCCACGATGACGTCATACGCGAACGTTGTATCACCTCCCGGCACAGAAACTACAGTACAAGGTCTAGTAGGAGCAGTCTTCGAGGGTGTTGGTGTATCTCTTGGTAGCTTCATCGGGGGTCGACTCTATGAAATTCATGGGGGTTGGAACACTTTCCAATGGTTTGGAATTTTCGCATTCATAGCATGCGCCATTCATGCTACAGTTCAGTACTTAATGAGGGACAGATACACACACGTCAATCAAGGTTATACGTCAGTTATACGATTCGGTCATTCACCCGATGCTGTATTTATGTTGGAAGATATGGATGAAGGACGTTAA

Protein sequence:

>DPOGS206793-PA
MNINKKLLPMKGHFFLFNAGTAPVVPYLSTYARQLGFSSATVGLIYTVLPIFGLIAKPLFGVIADRFKIQKSIFILFQVVTIVSFLAIYFIPETGMKTSVELDCGNGVTFLRSCYESNSQVDMCKVTVLENKNETAVCKMKCDMTSPKMWQTVCEHWHIPQYCYSNSKNIEYLTHISNIKIKDQCLNMASNNVTLDGYTFTPHCRVGMGFVDINEPCTLSCKNEMLSKLIGDNKPNMTCVDNMMNFRLCTNNSLPVSDEFDECRASCDLDPSTPWKLMEICEGWEADVTSACQPTNSHFPDTLEFNGTILLSSTITEHQCVYIQLNHIQMPDGSIHYPNCIYKSQYQVEANLFHASCEIECDNEMVNELFSTSDVKQSEYGLQFWLFFLMMIISWVGQAVVVTFADAICFGLLDTKISRYGKQRLWGSVGFGIISLVTGTLIDVFSSGAYKDYTIAFVLMTVFMFGDVVVSCFLQVESTKMSVNILADVGSLLTSLPTFVFLIWTISVGLCTGLIWNFLFWHIEDISGLNCEVEYVKTLQGLVSAIQTFGGEIPFMFVSGYLLKKIGHINAMTLVLFALGLRFILYSILTNPWWILPIEMFQGITFGMFYPTMTSYANVVSPPGTETTVQGLVGAVFEGVGVSLGSFIGGRLYEIHGGWNTFQWFGIFAFIACAIHATVQYLMRDRYTHVNQGYTSVIRFGHSPDAVFMLEDMDEGR-