Monarch geneset OGS2.0

DPOGS210111
TranscriptDPOGS210111-TA2838 bp
ProteinDPOGS210111-PA945 aa
Genomic positionDPSCF300017 + 1278571-1285766
RNAseq coverage336x (Rank: top 34%)
Annotation
HeliconiusHMEL0050610.070.51% 
BombyxBGIBMGA000220-TA0.071.59% 
DrosophilaCG31121-PB0.041.52% 
EBI UniRef50UniRef50_D6X0C60.051.84%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X0C6_TRICA
NCBI RefSeqXP_968472.10.051.84%PREDICTED: similar to GA16025-PA [Tribolium castaneum]
NCBI nr blastpgi|910910960.051.84%PREDICTED: similar to GA16025-PA [Tribolium castaneum]
NCBI nr blastxgi|910910960.052.58%PREDICTED: similar to GA16025-PA [Tribolium castaneum]
Group
Gene OntologyGO:00160209e-09membrane
GO:00055242e-05ATP binding
GO:00168872e-05ATPase activity
KEGG pathwaycfa:4745719e-45 
 K05684 (ABCG8)maps-> ABC transporters
InterPro domain[702-833] IPR0135259e-09ABC-2 type transporter
Orthology groupMCL15682 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210111-TA
ATGATGCAGGATCCAAGGGGAATGCATTCAGAAGATCTACACGCCTGGTCCATATACAGGCAAAACTTAAACTCCGATTTCACTGACAGTGCCTTAGGCAGTACCGATAAAAGTCCTCTACCGTACGGAAATTTCCAACTTCGCGATACTACAGTACAGTCTATCCTTTCACATCCCAGATATGGACCTAAGTCGGCTCTTGGATCTAATATGTATACATATCTTAAGTTCGGATTGCCACGTGTTTTTCCACCGAATCACAATGGATCACAACGATCAGGGACACCGAAGCCCAAAACATCGATTGGAAGCGGAATGAAATCCGTTCCAAGAATTCAAAGACAACCTCGTGGTGCCAGAGAAAACTCCTCCGGATACGATAGCTCTGATAACGAGACGTCAACCAATTATAAATACAGTCGCAAGTATCGATCGGACCCCGATTTTAGAATGCAAAACCTACATCCATCTTATCAGCATGCGACTGGAGTTCCCCTAGTGGCGATGCAGCAAGCAGGAATACGTGGCGAGTCTCATTGGACCAGCACAAGAAACAAGTCAGTGAGTGAGGCCAACCTGCTGGGGATAGACTCGAGACCCACTCGGTCCTACGGCCATCGGAGAAGTAGTGTCGTGGATTACGTGCCCGACAATGACCACCAAAGTTATCTCATGCCGTCCTCTCACCTCGGAGGTCGAATGTCCAAAGCAGGGAGTCATATGTCCTTAGCTCACTCCAGGAAACATTCAACTCTCCGACCTGGTGATCATTTAGATGGATACACGCATTCTGCATACATCTATCCTAATTACTACGTAAATAACTTAGAAATAACGTCGCCGGAAAATAAATCGACGCTGCTTGTGTCGGGGCTCAGTTTCGAAGTGAAATCTGGCGAAATATTAGCAGTGCTGGCGACATCACACCAGGAAGCGACTGGCTTACTGGACGTACTGGCTGGAGTCAGGAAAAAGCTGTCGGGCGATATAATCCTGAACGGTCAGCCGGTGGCGTCGTCGACTCTCCGTAAGGTGAGCGCGTACGTTCGCTGCGACACGTGTCTGTGCGGGGCCATGTCCGTGGAACACACGCTGAGGTTCCACGCCACGCTACGAGCTCCCCGACACCGACACGCCAAGATGGACGACAGGGAGCGGATAAACCTACTTATCGAGGAGTTAGGTCTGGAACAAGTAAGAGACACTAACGTAGAGCGACTGACTCGTTCCGAGATCAGGCGGTTGAATGTGGCGTGTCAGCTGCTGTTAGACACGGCGGTCCTCATACTCGACCAGCCCACCAAGGAGATGGACATCTTCGACACTTTCTTCCTGGTGGAGTACTTGAGGAACTGGGCGAGCACCGGACGTGTCGTCATAATGTCTCTACATCCACCCACTTACGAAATATTTGCTATGTTAACTAAGGTGGTGCTCATCTCCGCGGGTCGGACGATGTTCAGCGGCTATAGAAGGGATATGTTGCCATATTTCGCCTCCATAGATTATCCGTGTCCCGCCTATAAGAACCCTTCTGATTACTATCTGGACCTGGTGACCCTGGACGACCTGTCGGCGGCGGCCATGTTGGAGTCGTCAGGTCGTATCGAGTCGCTGGCCGGAGTGTTCTTGGGCGCGCACTCCGCCCCCGAGCCCCCGCCCCCCGTCCCCCTGCCGGCTCCTGTCCGCCGAGCCAACGTCCTCGTCCAGGTGTTCGCTATGCTGGAAAAATCGTTGCTGTACACTCAAATGACGACGTTGTCAAACGTAATTACAAGAATTCTTATAGCAGCCATCATGTCGATCGTCACGGGCGCCGTGTTCTGGGACCTGCCCTCCACCGACCCCAAGTTAACGCTCAACGACCGCGTGGGGTTCCACTACTCGGTGATGTGTGTGTCGCTGTTCCCGTGCCTGGTGTGGTCGTGTCGCGAGGCGGCCGCGGCCCGGCCTCACGTGGAGAGAGACATCGCCGTGGGGCTTTACTCACGGACGCTGTTCATACTGTTCGATTTAACGCTCAACGACCGCGTGGGGTTCCACTACTCGGTGATGTGTGTGTCGCTGTTCCCGTGCCTGGTGTGGTCGTGTCGCGAGGCGGCCGCGGCCCGGCCTCACGTGGAGAGAGACATCGCCGTGGGTCTTTACTCACGGACGCTGTTCATACTGTTCGATCAATTCATGGAGTTGTGGTCGGCTACGTTGACGTGGTTGGCGTATTTAGTCCCGAGTTACGCTATGAGCGGTCTGTACGCGCAGACCGCGGGCTCCTTCGACGGGTTCTACATTTATTTAGGTTATATGTTGTTGTACCTAATAAGTACTCAGATGTTGTGTCGCGCGGCGGTGTTTGTCGTGCCGAAGGAGAAGTCTTCCGCCGCGTTGGCTTGTTTCTGTTTGTTCCTAACAACTCTCGTGAACGGCGTGACGCTGCACCAGCTCGACTTACCCTTTTACGTCAAATGGTTGGAATACGTGTCGCCTTCGAAATGGACAATACCGGAGATATTGAGGCGGGAACTGAGTGACGTCGCGCTGAGATCTAGTATAAGCAAGGATTTGAGATGTACAAATAAACAGCGACAGCATCTGGAGATCATAGTCCAGTCGTCGTGTCCGCTGCCCAATGGCACCCAGGTGCTATCGAACTTTGACTTCCTCCGCTCAGATCACATCTGGGAGTGGACCGAGGACAGTTTCCTCGTGGCTCTGTCTATTTTCTATGCGGTTTTCGCTCTAGTCGCCATATTTGCGTTTGTTTTCGATTGTACCGACTACGTCAGGAGTAAGGAACGAGCGTCGCGGAAAGGTTACAAAGTGACCGCCAACACGCCCTAG

Protein sequence:

>DPOGS210111-PA
MMQDPRGMHSEDLHAWSIYRQNLNSDFTDSALGSTDKSPLPYGNFQLRDTTVQSILSHPRYGPKSALGSNMYTYLKFGLPRVFPPNHNGSQRSGTPKPKTSIGSGMKSVPRIQRQPRGARENSSGYDSSDNETSTNYKYSRKYRSDPDFRMQNLHPSYQHATGVPLVAMQQAGIRGESHWTSTRNKSVSEANLLGIDSRPTRSYGHRRSSVVDYVPDNDHQSYLMPSSHLGGRMSKAGSHMSLAHSRKHSTLRPGDHLDGYTHSAYIYPNYYVNNLEITSPENKSTLLVSGLSFEVKSGEILAVLATSHQEATGLLDVLAGVRKKLSGDIILNGQPVASSTLRKVSAYVRCDTCLCGAMSVEHTLRFHATLRAPRHRHAKMDDRERINLLIEELGLEQVRDTNVERLTRSEIRRLNVACQLLLDTAVLILDQPTKEMDIFDTFFLVEYLRNWASTGRVVIMSLHPPTYEIFAMLTKVVLISAGRTMFSGYRRDMLPYFASIDYPCPAYKNPSDYYLDLVTLDDLSAAAMLESSGRIESLAGVFLGAHSAPEPPPPVPLPAPVRRANVLVQVFAMLEKSLLYTQMTTLSNVITRILIAAIMSIVTGAVFWDLPSTDPKLTLNDRVGFHYSVMCVSLFPCLVWSCREAAAARPHVERDIAVGLYSRTLFILFDLTLNDRVGFHYSVMCVSLFPCLVWSCREAAAARPHVERDIAVGLYSRTLFILFDQFMELWSATLTWLAYLVPSYAMSGLYAQTAGSFDGFYIYLGYMLLYLISTQMLCRAAVFVVPKEKSSAALACFCLFLTTLVNGVTLHQLDLPFYVKWLEYVSPSKWTIPEILRRELSDVALRSSISKDLRCTNKQRQHLEIIVQSSCPLPNGTQVLSNFDFLRSDHIWEWTEDSFLVALSIFYAVFALVAIFAFVFDCTDYVRSKERASRKGYKVTANTP-