Monarch geneset OGS2.0

DPOGS212609
TranscriptDPOGS212609-TA1878 bp
ProteinDPOGS212609-PA625 aa
Genomic positionDPSCF300245 + 50022-72618
RNAseq coverage519x (Rank: top 24%)
Annotation
HeliconiusHMEL0067806e-14242.46% 
BombyxBGIBMGA005094-TA5e-12737.50% 
DrosophilaCG31689-PA3e-14441.61% 
EBI UniRef50UniRef50_E0VP176e-15948.34%ABC transporter, putative n=5 Tax=Neoptera RepID=E0VP17_PEDHC
NCBI RefSeqXP_971735.12e-18051.57%PREDICTED: similar to abc transporter [Tribolium castaneum]
NCBI nr blastpgi|910817233e-17951.57%PREDICTED: similar to abc transporter [Tribolium castaneum]
NCBI nr blastxgi|910817236e-17351.48%PREDICTED: similar to abc transporter [Tribolium castaneum]
Group
Gene OntologyGO:00055244.5e-239ATP binding
GO:00160214.5e-239integral to membrane
GO:00160208.6e-27membrane
GO:00168871.4e-15ATPase activity
GO:00001664.7e-14nucleotide binding
GO:00171114.7e-14nucleoside-triphosphatase activity
KEGG pathwaymmu:1926632e-125 
 K05680 (ABCG4)maps-> ABC transporters
InterPro domain[29-623] IPR0200644.5e-239ABC transporter, G1-like
[351-559] IPR0135258.6e-27ABC-2 type transporter
[84-215] IPR0034391.4e-15ABC transporter-like
[69-263] IPR0035934.7e-14ATPase, AAA+ type, core
Orthology groupMCL17074 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212609-TA
ATGACTACGAGCTTGACGCTTTTGCAAGGATTTAGTCCGAAATTACGTGGAAGGGAACAGAATGAATGCCGAACTCTGCAAATAAACATGCCGCCTCCTGTAAACGTTGATATTGAATTCAAAGGTGTCACAATGGAAGCTACAACTGGATTCATAAAAAAAAATAAAAAAACTATTTTAAAAAATATTTCCGGAGATTTTAAGTCCGGTGAGCTAACGGCCATAATGGGACCGTCTGGAGCCGGCAAAACATCACTCCTTAACGCCCTTACAGGATATTCGTTAAAAGGGGTGACGGGTGTGATCCGAGCTGGCAACAGCGTATGTGAATTTGACAGCACTAATTCATATCAAACCCTGAACGCGTACCGTAAGAAGTCGTGCTACATACTGCAAGACGATAAGCTGAACCCCTTATTTTTGGTCTCCGAGCAAATGCAATTCGCTGCGGACTTAAAGTTAGGCGATACATTTACGCAGAAGTTAAAAAACTCAGTTATATCAGATGTGCTGAAGACCTTGGGTCTGACGGGAACTGAGAACACTCCTTGTAGTAAACTATCAGGTGGACAGAGAAGAAGACTCTCCATCGCCGTGGAGCTAATTGACAATCCACCAGTTTTATTTCTTGACGAGCCTACGACAGGTCTAGACAGTGTCTCATCCAAACAATGCATGGAATTATTACAAAATCTCGCACGAGTTGGTCGGACTATAATCTGCACAATTCACCAACCGTCTGCTACTATATATAAAATGTTCGATCAGGTATACGTGTTGGCAGAAGGTCAGTGTGTCTATCAAGGTCCCAGTTCTCACACAGTCCCTCATCTGGCTAGTCTTGGTCTCGTCTGTCCCAAGTACCACAATCCAGCGGATTACATTTTAGAACTAGCAAATGGCGAACATGGTCAACTCAATGAACTTCTCACAGCTAACTGTGTTCCTGAAAAACTAAATGAACGTATTCCGGAATTGCCCATAGAGAATCGTCCAGCACTTTCAAATGAAAAAATGACCATAGTAATAAATAGACCTTACGAATTTTACAAGTTTAAAGTGCTGTTCAAGAGGTGTGTCGTGCAACAGTGCCGTGATTGGACAGTAGTTCCCCTTAGGATGGTCATCCATGTAATCATCGGTATTATGTTGGGACTGTTCTTCAATAACGTCGGCAATGATGCTTCCAGAACACTCAGTAACCTTGGCTTCCTCATTATATCACCTACCTACCTCTGTTATACATCCCTCATACCAGCAGTGCTCAGATTTCCCGATGAGTTACCAGTGTTGAAGAAAGAACATTTTAATAACTGGTATAACCTCAAAACTTATTATATAGCAGTCCTAGTGACGAACACACCAATACAGATCTGCTACAGTTTAATATACTCAATACCAGCGTATCTTCTCAGCGGACAGCCGATAGAGTTACATCGTTTCGCGATGTTCGTTCTCATTCTATCAAATGTATCTCTCATAGCAGACACTATAGGCATATTCATTGGTTCTTGCGTCAATCCTATTAATGGTACATTCCTGGGAGCGATAACGACCTGCGTGATGATAGTTTTCGCTGGTTTCCTGGTGATACCATCTCATATGCCGTCGATACTCCAACCTGTGGCGCATGGGTCTTTCCTGAAACAGGTGTACGAAGCGCTTACACTCTCTATGTACTCCTTCGATAGAACACCACTAGCGTGTGGTGAAGATAAGATATACTGTCACCTCAAATATCCCCAAACGATCCTCAAAGAACTAGATATGCAGCCTGGAAACTATTGGCAGGACGTGTGCATTATTCTTGTAGAATTGTTACTACTTAGGATAATTGCATTTCTCTCGTTGAAGCGTAGAGTTAAAAAGTGTAATTAA

Protein sequence:

>DPOGS212609-PA
MTTSLTLLQGFSPKLRGREQNECRTLQINMPPPVNVDIEFKGVTMEATTGFIKKNKKTILKNISGDFKSGELTAIMGPSGAGKTSLLNALTGYSLKGVTGVIRAGNSVCEFDSTNSYQTLNAYRKKSCYILQDDKLNPLFLVSEQMQFAADLKLGDTFTQKLKNSVISDVLKTLGLTGTENTPCSKLSGGQRRRLSIAVELIDNPPVLFLDEPTTGLDSVSSKQCMELLQNLARVGRTIICTIHQPSATIYKMFDQVYVLAEGQCVYQGPSSHTVPHLASLGLVCPKYHNPADYILELANGEHGQLNELLTANCVPEKLNERIPELPIENRPALSNEKMTIVINRPYEFYKFKVLFKRCVVQQCRDWTVVPLRMVIHVIIGIMLGLFFNNVGNDASRTLSNLGFLIISPTYLCYTSLIPAVLRFPDELPVLKKEHFNNWYNLKTYYIAVLVTNTPIQICYSLIYSIPAYLLSGQPIELHRFAMFVLILSNVSLIADTIGIFIGSCVNPINGTFLGAITTCVMIVFAGFLVIPSHMPSILQPVAHGSFLKQVYEALTLSMYSFDRTPLACGEDKIYCHLKYPQTILKELDMQPGNYWQDVCIILVELLLLRIIAFLSLKRRVKKCN-