Monarch geneset OGS2.0

DPOGS200174
TranscriptDPOGS200174-TA2115 bp
ProteinDPOGS200174-PA704 aa
Genomic positionDPSCF300128 + 432714-447752
RNAseq coverage118x (Rank: top 58%)
Annotation
HeliconiusHMEL0210840.064.17% 
BombyxBGIBMGA002924-TA0.079.42% 
Drosophilast-PA0.051.76% 
EBI UniRef50UniRef50_F1SZW30.074.12%Scarlet n=1 Tax=Bombyx mori RepID=F1SZW3_BOMMO
NCBI RefSeqXP_968696.10.051.85%PREDICTED: similar to scarlet [Tribolium castaneum]
NCBI nr blastpgi|3796989020.074.12%scarlet [Bombyx mori]
NCBI nr blastxgi|3796989020.074.44%scarlet [Bombyx mori]
Group
Gene OntologyGO:00160209.4e-30membrane
GO:00001661.1e-13nucleotide binding
GO:00171111.1e-13nucleoside-triphosphatase activity
GO:00055241.3e-11ATP binding
GO:00168871.3e-11ATPase activity
KEGG pathwaytgu:1002219846e-74 
 K05681 (ABCG2)maps-> ABC transporters
InterPro domain[399-608] IPR0135259.4e-30ABC-2 type transporter
[106-298] IPR0035931.1e-13ATPase, AAA+ type, core
[121-250] IPR0034391.3e-11ABC transporter-like
Orthology groupMCL16669 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200174-TA
ATGCGATCCGTCACCAAGACCGACACCGATGGCAGTCCGTCCACCGGATCCTCGGGGAATTCGAGCCCGAATAGTTTAAACGACGAACACGCCGACGATCCCAATGAACTACCTGGAGGTATATTATACGTAGATAGTGGAATAACCTATCCGAAATACTATGATGAGCCGTACTTCGATGAGGTGGAGGAGCTCTTGGGTTCCTCATCCTCACCAGCTCCCTGTACCCTCGTGTGGAGGGACGTCACCGTACATATCAAACTGAAAAATGGAAAGCTCAAGAGACTAGTTAATAATGTGAGCGGCATCGCGAAACCCGGAACTCTAGTGGCCCTCATGGGACCTAGCGGTGCCGGCAAGACAACCTTGATGACCGCGTTGGCCCAGCGGAGTCCAGACGATACAATCGTGGATGGTGCAATTGCTATGAATGGCATGCCCATAGGAGACTTCATGCATCGAGAGAGCGGGTACATGCATCAGGACGAGCTGTTCGTTGAGAATCTAACAGTTATGGAACACCTCACTATAATGTCTCGTTTGAAAATGGACCGTCGTACCTCTCCTTTGGCTAGACGGCGAAAAGTCAATCAGTTATTAAGACAGCTGTCCCTCTACGGAGCCAGACATACGAGGATAGGTGGCTTGGACGGTTTAAAGACCCTGTCCGGGGGCGAAAGGAAACGACTGGCTTTTGCTACTGAGTTACTAACCGACCCGGGGCTTCTGTTCTGCGATGAGCCCACTACCGGCCTGGACTCGTCGTCAGCACAAAAGCTCATAACTTTATTACGGGCCAGTGCGGTTCAAGGCAAGACCATCATATGCACGATACATCAACCGTCCTCTGAACTTATGGCCTTGTTTGATAAATTGGTTTTGCTCGCCGAAGGACGAGTCGCGTTCGCTGGAAATGCCTCCGGAGCTCTAAGTTTCTTTGAAAGTCTGGGCTATCAATGCCCCATAACCTACAATCCCACCGACTATTTCATCAAAGTCTTGGCGTTAACCCCTGGATCTGAGGGTGCATCGAGACAGGCAATCAAAAGCGTTTGTGACAGATTCGCTGTCAGTGATGCAGCCAAAGAGTTGGACATGGAGATCCATTTAGAGTTCCACATTATGGAGAACGAAGATGAGGAGTCAAAGAAATTAAAATTCACGCATTACAAGTCGCCGTTCATTCATACTAAAATAGCCTGGCTCGTGTATAGATACCTGCTGATCATCGTGAGAGATCCCAGAGTTCAACTAGTGCGAATTATACAAAAACTGGCTATAGCGATAACAGCGGGGCTGTGTTTCCTTGGAACAGCTCGCCTAACTCAGGCTGGTATACAGGATGTTCAAGGGGCGCTGTTCATCATCATCGCTGAGAATACTTTCATTCCGATGTACTCTGTGCTAAACATGTTCCCTGAAGAGTTTCCCCTCCTCCAGAGAGAGCTCAAGGCTGGACTGCACTCTACGACTATATATTATGTATCAAGAATGCTCGCTTTGTTGCCAGGGTTAGTGATCGAGCCCACGTTGTTCACTCTAGTGGTGTACTGGGTCGCGGGGTTGCGGGCTACGCTCTATGCGTTCGGCTTTACCGTTTTGCTCGCCATACTAGTACTGAACGTGGCGATCGCTTGTGGATCATTCTTTTCATGCGCTTTTGGTTCTATGCCGCTGGCCATCGCTTACCTCGTCCCCTTCGACTATTCCCTCATGATGACTTCAGGATTATTTATTAAACTGAGCTCCATGCCGAAATACGTGTCCTGGATACGATACATGTCCTGGCTGATGTACTCCAACGAAGCTATGAGCATCCTTCAGTGGGATGGAGTCCAGAACATAACGTGTACATTACCAGAGAATGAAGCGCCATGTGTGAGTTCTGGCCAGGAGGTCCTGCAGGTGTACGACTTCGAGAAGACTAAGTTCTGGATTGACATAATGGCCCTGGTGATAATGTACCTGACTTTCCACCTGCTGGCGTTACTCGCGCTCCGAAGTTTTGTTAGTCGTGGGATTCAACAGTTTATTTCTCTGTTGGCGCTAAGCGATGCAATAGCGGGCTATCGTGCCGCGGGAATGCAAGCCTCCCTGTCTCTCGGCGGGAAGTAA

Protein sequence:

>DPOGS200174-PA
MRSVTKTDTDGSPSTGSSGNSSPNSLNDEHADDPNELPGGILYVDSGITYPKYYDEPYFDEVEELLGSSSSPAPCTLVWRDVTVHIKLKNGKLKRLVNNVSGIAKPGTLVALMGPSGAGKTTLMTALAQRSPDDTIVDGAIAMNGMPIGDFMHRESGYMHQDELFVENLTVMEHLTIMSRLKMDRRTSPLARRRKVNQLLRQLSLYGARHTRIGGLDGLKTLSGGERKRLAFATELLTDPGLLFCDEPTTGLDSSSAQKLITLLRASAVQGKTIICTIHQPSSELMALFDKLVLLAEGRVAFAGNASGALSFFESLGYQCPITYNPTDYFIKVLALTPGSEGASRQAIKSVCDRFAVSDAAKELDMEIHLEFHIMENEDEESKKLKFTHYKSPFIHTKIAWLVYRYLLIIVRDPRVQLVRIIQKLAIAITAGLCFLGTARLTQAGIQDVQGALFIIIAENTFIPMYSVLNMFPEEFPLLQRELKAGLHSTTIYYVSRMLALLPGLVIEPTLFTLVVYWVAGLRATLYAFGFTVLLAILVLNVAIACGSFFSCAFGSMPLAIAYLVPFDYSLMMTSGLFIKLSSMPKYVSWIRYMSWLMYSNEAMSILQWDGVQNITCTLPENEAPCVSSGQEVLQVYDFEKTKFWIDIMALVIMYLTFHLLALLALRSFVSRGIQQFISLLALSDAIAGYRAAGMQASLSLGGK-