Monarch geneset OGS2.0

DPOGS210042
TranscriptDPOGS210042-TA1902 bp
ProteinDPOGS210042-PA633 aa
Genomic positionDPSCF300017 - 1253946-1260104
RNAseq coverage359x (Rank: top 33%)
Annotation
HeliconiusHMEL0059090.082.53% 
BombyxBGIBMGA000472-TA0.074.14% 
DrosophilaCG11069-PA6e-15447.57% 
EBI UniRef50UniRef50_E0W1F27e-17649.85%ATP-binding cassette transporter, putative n=8 Tax=Pancrustacea RepID=E0W1F2_PEDHC
NCBI RefSeqXP_968555.10.057.25%PREDICTED: similar to AGAP002051-PA [Tribolium castaneum]
NCBI nr blastpgi|910910980.057.25%PREDICTED: similar to AGAP002051-PA [Tribolium castaneum]
NCBI nr blastxgi|910910980.057.25%PREDICTED: similar to AGAP002051-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055244.1e-07ATP binding
GO:00168874.1e-07ATPase activity
KEGG pathwaydpo:Dpse_GA107391e-151 
 K05683 (ABCG5)maps-> ABC transporters
InterPro domain[64-176] IPR0034394.1e-07ABC transporter-like
Orthology groupMCL11616 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210042-TA
ATGATTGGAAGTGACTACACATTGGAATTATGCAACGTCTTCCACTCTGGACAAGTGGAGCCAGGGAGCTTCTTCCAGCGCCTGACGGGCAGCGTCAAGACGGGTGTCATACTAAAAGACGTTTCCTTCATCACGCACAGTGGGGAAGTCACAGCCATACTCGGCTCCAAAGGTAGCGGTAAACGCGCTCTCCTGGATGTCATAAGCCGCAGGGTTCCTTCCAAAGGCCACGTCCTGCTGGAAGGTCTGCCGCTGGAAAAAGAACAGTTTATGAACACCTGCGCGCTGGTACGACACTCTACTAAACTGATGCCCGGCCTCACAGTTCAACAGACGTTATCATTGTCTCTGACTAAAATATCTGGATACTTGAAGGCTTCGAAGGTTAAGCAAGTAATGGCCGACTTAGCACTATCGCAGGTAGCAAACAAATGCGTAACGAGTCTAACTAAGAGCGAGTACAGGCGGCTTGTGATCGGGGTGCAACTCATAAGAGATCCGATTATTTTACTATTGGACGAACCGACTTGGGACTTGGATCCACTCAACACATACCTAGTGATATCCATACTGTCCAACGCTGCCAAGAAATACGGCACAACTATCATACTCACCATGGAGAAACCGAGATCTGATGTCTTCCCTTTCCTTGACAGGGTAGTTTACCTGTGTCTGGGTGACGCGGTGTACGCGGGACCCACTCGCGCTCTACTGGACTATTTCACCGGCATCGGTTTCCCGTGCCCGCAGCTTGAGAACCCACTTATGTATTACCTATGTCTGTCGACGGTTGACCGACGTTCTCGGGAACGGTTTATAGAATCGAACCATCAAATCGCAGCCCTGGTGGAGAAATTCAAAACCGAGGGTGTTCCTCACGAACATGGAAGGAGCAACCCTAACAAAATACAGATGAGCTATGGAAAGCCGAGCGGCGTGCGAGTTATATGGATGCTATATTTACGCACGCTCGCTTCAATATTCAATTTAAGGAAACACGGCATCAAGCAAATGTCCATGAGACTCCTGACATTGCCGATTTACTTTTTCATTCTTTGGATCTTCTACAACGACGCTAAGGACTATCAACGTGCTTTCATAACAAAAAGTGGCCTCATTTTCAACGCTATGACCGGCACATACTTCATCAGTATATTGAACACGATATGTTTGTTCGGTCCGTACCGGTCTCGTTACTACTGTGAAAGCGAAGCGGGCGTGTACTCCGGGGCGAGCGCCCTATTAGCCTGGTCACTAGTTTCCTTACCAGCCTCGCTACTAACAAGTCTCGCTGCAGCCGCCATAGTCTACCCGATACTGGGAGACATATCTGAGGGTGTGGCCTTCCTGCAGTTCGCTTTGATCCTGTGGTCGTGCTACATCTACGCTGAACAACAAACCATTGCTATCATGATGTTCGTTAAGAACGGACTCGTCACCGCCCTAATCAATATATACATCACCTGCGTCTACGTCATGCTCGCGAGTGGAGTGTTGAGATCTTACAAAGGCTACGAGGACTGGATGTTCTACCTGACATACTTGACACACACCCGGTACGCTTCAATATTCCTACACAGGAGTGTCTTCAAGCAACCCACGTTCAACATACTTCCGTACAGTGAGAATGAGAACTGCACGTCCATAACAAATCTCATACAGACATCATCCAACATGAACGCAAACTCCAACGCCAACTGTCGCTATCCCAGCGGTAAAGCCTTCTTAACAGAACGCTTCACGTACAAGAACTTCGCCGGCGACATCTATCAGAGCGGTGACTTTAATATGGAATTCAATTTAGGTATTTCCTTCGCATTCTCGTTGGGAATTATTATCCTTAACAAATTTCTATACTTAATACCGCTGCCGGGATATATTGTGGATAAATTTAGGGAATAG

Protein sequence:

>DPOGS210042-PA
MIGSDYTLELCNVFHSGQVEPGSFFQRLTGSVKTGVILKDVSFITHSGEVTAILGSKGSGKRALLDVISRRVPSKGHVLLEGLPLEKEQFMNTCALVRHSTKLMPGLTVQQTLSLSLTKISGYLKASKVKQVMADLALSQVANKCVTSLTKSEYRRLVIGVQLIRDPIILLLDEPTWDLDPLNTYLVISILSNAAKKYGTTIILTMEKPRSDVFPFLDRVVYLCLGDAVYAGPTRALLDYFTGIGFPCPQLENPLMYYLCLSTVDRRSRERFIESNHQIAALVEKFKTEGVPHEHGRSNPNKIQMSYGKPSGVRVIWMLYLRTLASIFNLRKHGIKQMSMRLLTLPIYFFILWIFYNDAKDYQRAFITKSGLIFNAMTGTYFISILNTICLFGPYRSRYYCESEAGVYSGASALLAWSLVSLPASLLTSLAAAAIVYPILGDISEGVAFLQFALILWSCYIYAEQQTIAIMMFVKNGLVTALINIYITCVYVMLASGVLRSYKGYEDWMFYLTYLTHTRYASIFLHRSVFKQPTFNILPYSENENCTSITNLIQTSSNMNANSNANCRYPSGKAFLTERFTYKNFAGDIYQSGDFNMEFNLGISFAFSLGIIILNKFLYLIPLPGYIVDKFRE-