Monarch geneset OGS2.0

DPOGS212638
TranscriptDPOGS212638-TA2025 bp
ProteinDPOGS212638-PA674 aa
Genomic positionDPSCF300245 + 477442-501425
RNAseq coverage582x (Rank: top 22%)
Annotation
HeliconiusHMEL0145740.086.43% 
BombyxBGIBMGA005202-TA0.082.55% 
DrosophilaCG3164-PC0.072.98% 
EBI UniRef50UniRef50_E0W3T60.075.66%ABC transporter, putative n=7 Tax=cellular organisms RepID=E0W3T6_PEDHC
NCBI RefSeqXP_397486.20.076.42%PREDICTED: similar to CG3164-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3263711470.086.73%ATP-binding cassette transporter subfamily G [Bombyx mori]
NCBI nr blastxgi|3263711470.086.73%ATP-binding cassette transporter subfamily G [Bombyx mori]
Group
Gene OntologyGO:00055240ATP binding
GO:00160210integral to membrane
GO:00160201.9e-39membrane
GO:00168875e-21ATPase activity
GO:00001661.4e-12nucleotide binding
GO:00171111.4e-12nucleoside-triphosphatase activity
KEGG pathwaycin:1001791079e-179 
 K05680 (ABCG4)maps-> ABC transporters
InterPro domain[13-674] IPR0200640ABC transporter, G1-like
[402-606] IPR0135251.9e-39ABC-2 type transporter
[77-199] IPR0034395e-21ABC transporter-like
[62-255] IPR0035931.4e-12ATPase, AAA+ type, core
Orthology groupMCL10154 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212638-TA
ATGATATCCGACAGGAAGGGCAGTCTGAAGGTGACGATCACTCCCGCCCAGCACAAGACCCTAACTCACCTTCCGAAAAGGCCGCCAGTGGATCTGGCTTTCACAGATCTCACATACAAAGTACAGGAAGGCCGGAAAAGCAATGTGAAAACTATCCTCAAGTCAGTCTCGGGGAGGCTTCGCTCCGGCGAGCTGACGGCCATCATGGGTCCATCAGGCGCTGGAAAATCAACGCTGCTCAATATATTGACTGGTTACAAGACTTCTGGGATGGAGGGTAGTATCACCGTGAACGGGATGGAGCGTAATCTGTCCAGCTTCCGCAAACTGTCATGCTACATCATGCAGGACAATCAGTTGCATGGAAACCTTACCGTAGAAGAGGCCATGTCTGTTGCGACGGCTCTCAAGCTGCCTAGCGCTACCACCAGGGACGACAAAGAGGAAGTGATACAAGAGATTCTTGAAACCCTCGGTCTGTCAGAGCATCACAAAACGATGACGTCAAACTTGTCCGGGGGACAGAAGAAGCGGTTGTCGATAGCTTTAGAGCTGGTCAACAATCCGCCGATCATGTTCTTCGATGAGCCGACATCCGGTTTGGACAGTTCCTCCTGTTTCCAATGTATATCACTGTTGAAGACCCTGGCCAGGGGTGGCAGGACCATCATCTGCACCATCCACCAGCCTTCCGCGAGATTATTCGAAATGTTCGACCATTTGTACACGTTGGCGGACGGGCAATGCGTCTACCAGGGATCCACGGGGAGACTGGTGGAATGGCTGGGGAGTCTAGGGCTCCAGTGCCCGTCGTACCACAACCCGGCATCGTTCATCATCGAGGTGTCCTGCGGCGAGTACGGAGACAACACCGGCAAGCTGGTCCGAGCCATCGAGAACGGGAAAAATGATATCAGGACCGGGATGCCGCTTCCGAAGCCTCTGGAGTACAACAACAAACCGGATATGGAGGCTTCGCTGAAGAACGGCTGGGATAAGAACGACGCCTCACAGTTCAGAGACAAGGAGGCCAATGGGAACGGGAACACTAACGTACAGAACGGCATAGTGCAGTACAGCGACGTGGCCAGGGCGAAAGGTGACCTCCTGGTTCAAGTGGATACTGAGAAACAGGACAATGCGGAAGTTGCGCTTCTGGGAACGGAAGCTTCACCTAGAAGGTACGCCACCTCCGAGCTGGTGCAGTTCTGGGTGGTGCTGAAGAGGACCTTACTGTTCAGTAGGAGGGATTGGACCTTGATGTATCTGCGTCTGTTCGCCCACATCCTGGTAGGGTTCCTGATAGGGGCGCTATACTACGACATCGGTGACGACGGCTCCAAGGTCCTCTCGAACCTCGGCTTCCTGTTCTTCAATATGCTCTTCCTCATGTACACTTCGATGACCATCACTATACTTAGCTTCCCACTAGAGATGCCGGTGTTAGTGAAGGAGCATTTCAACAGATGGTATTCCCTCCGCTCATACTACCTGGCTATAACCGTCTCTGATATACCATTCCAGGCCATCTTCTGTATCATCTACGTGGTGATCGTGTACCTGCTGACGTCACAGCCCTTGGAGTGGTTCAGGTTCAGCATGTTCCTGTCCTCGTGTCTGCTGATAGCCTTCGTGGCTCAGAGCGTCGGGCTCGTAGTAGGGGCAGCCATGAATGTGCAGAACGGCGTGTTCCTGGCGCCGGTGATGTCTGTTCCGTTCCTCCTGTTCTCTGGTTTCTTCGTGTCTTTCAACGCCATCCCGGTGTATCTGAGGTGGATAACGTACGTGTCCTACATCAGGTACGGCTTCGAGGGAACCGCTCTAGCCACGTACGCCTTCAATAGGACCAAGCTACAGTGCCATCAGGTGTACTGTCACTTCAAAAACCCCGAGACGATCCTTGAAGAGCTGGACATGATGAACGCCGATTTCACACTGGACATCGCAGCGTTATGTCTCATATTCCTAGTACTGCGTGTATCAGCTTTCCTCTTCTTGCGTTGGAAACTCAGATCCACCAGATAA

Protein sequence:

>DPOGS212638-PA
MISDRKGSLKVTITPAQHKTLTHLPKRPPVDLAFTDLTYKVQEGRKSNVKTILKSVSGRLRSGELTAIMGPSGAGKSTLLNILTGYKTSGMEGSITVNGMERNLSSFRKLSCYIMQDNQLHGNLTVEEAMSVATALKLPSATTRDDKEEVIQEILETLGLSEHHKTMTSNLSGGQKKRLSIALELVNNPPIMFFDEPTSGLDSSSCFQCISLLKTLARGGRTIICTIHQPSARLFEMFDHLYTLADGQCVYQGSTGRLVEWLGSLGLQCPSYHNPASFIIEVSCGEYGDNTGKLVRAIENGKNDIRTGMPLPKPLEYNNKPDMEASLKNGWDKNDASQFRDKEANGNGNTNVQNGIVQYSDVARAKGDLLVQVDTEKQDNAEVALLGTEASPRRYATSELVQFWVVLKRTLLFSRRDWTLMYLRLFAHILVGFLIGALYYDIGDDGSKVLSNLGFLFFNMLFLMYTSMTITILSFPLEMPVLVKEHFNRWYSLRSYYLAITVSDIPFQAIFCIIYVVIVYLLTSQPLEWFRFSMFLSSCLLIAFVAQSVGLVVGAAMNVQNGVFLAPVMSVPFLLFSGFFVSFNAIPVYLRWITYVSYIRYGFEGTALATYAFNRTKLQCHQVYCHFKNPETILEELDMMNADFTLDIAALCLIFLVLRVSAFLFLRWKLRSTR-