Monarch geneset OGS2.0

DPOGS202910
TranscriptDPOGS202910-TA4932 bp
ProteinDPOGS202910-PA1643 aa
Genomic positionDPSCF300126 + 273131-291790
RNAseq coverage271x (Rank: top 40%)
Annotation
HeliconiusHMEL0117816e-17143.39% 
BombyxBGIBMGA004187-TA7e-15760.53% 
DrosophilaCG1718-PB1e-16127.08% 
EBI UniRef50UniRef50_UPI0000D566CF1e-17528.51%UPI0000D566CF related cluster n=1 Tax=unknown RepID=UPI0000D566CF
NCBI RefSeqXP_969271.12e-17628.51%PREDICTED: similar to ATP-binding cassette sub-family A member 3 [Tribolium castaneum]
NCBI nr blastpgi|910856074e-17528.51%PREDICTED: similar to ATP-binding cassette sub-family A member 3 [Tribolium castaneum]
NCBI nr blastxgi|1571332792e-17427.98%ATP-binding cassette sub-family A member 3, putative [Aedes aegypti]
Group
Gene OntologyGO:00055241.9e-20ATP binding
GO:00168871.9e-20ATPase activity
GO:00001667.8e-09nucleotide binding
GO:00171117.8e-09nucleoside-triphosphatase activity
KEGG pathwayrno:3029733e-163 
 K05643 (ABCA3)maps-> ABC transporters
InterPro domain[528-648] IPR0034391.9e-20ABC transporter-like
[513-704] IPR0035937.8e-09ATPase, AAA+ type, core
Orthology groupMCL17647 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202910-TA
ATGAATACGTTAGGAGTTCTTATGTGGAAACATATGGTGGTTAGGAAGAGAAGATTCATCCACACTACAGTTGACATATTATCTCCTTTAGCATTCTTTGTGCTCCTGTATTTATTTAAGGGATATATAACCTCTGGTAGGAGATCTGCGATGTCTGATGAGTTTATAGTCCAAAATACTGAACCAGTGGATTTAGACAAGTTGCAAGGGCCCACAGCTGTATTCTACAGCCCTGACACAGATCTCACTGGCCTCCTCATGGACCAGGTCGGTGAAAGCTTACACTTGCAGAGGAAGAAATACACTCCAGGTTATCTTGAGGAGTTTGGATACATGCCCTTCCAAAATCTATCTGATATACTTGATGCTAATAGGAAGCTAACAGATACAGATGCCATAGTATTGTTTGAAAACATGAACAGCACGTGGCCTGAACGACTAAACTATACCATCAGGATGAAAGGTGACTTCCAGACGAACAAAGTGACCGTCAACGACGAGTCCTTGGGGCCACACGAGAGCTTCGGTACGATATATGAGCCCTTCATGAGACTGCAGTGGGCCATAGACACCAGCTACCTCAAACTGTTATCCGGCTCCGACATCAAACAGCGTGTGACGATCCAAGAGTTTCCTTACGTCCGCCAGCAAGAAGTGCCAGCAATAAAGAACGTCTGCAACCTTCTGCCCTTCATCTGCTGGATATCCTTACTGCTGACATTCGTGTATGTAATGTCGAAGCTTCTGGAAGAAAGGATCACTGGTATTCAGGAGCTGATCAAAATGGCGGGAGTGTCAAACTTCCAGATATACCTGTCCCATTTCCTAAATATGTTGCCTGTCGGCGTGATTTTCTGTGTTTTTGGGACCCTCGTTATGACACTGACCGCCACTCCCATCATACCACAAACCAGCGCCTTTCTCATCATGATATTCTTGATTTTGCACTTTATGAACGTGATGTGTATGGCGTATTGTAGCAATTTCCTCATCACCAACACTCAGTACTCGACGTCAGTGGCGGCTGTGGTGTACATCGTCGCGGAACTTCCAATAAGCTTAATTGGAAAAAGCTATCCTACCTGGGCGCGGCCGATCGTAGGATTGTTACCCTTTATGCCCTTGCACTGGTTCTGGTGGGAGGTCGGCGAGATGGAAGCGTACGGGAAGGGCGCTGGATTCGGATCTATTGCGACGATCCACGACGCTGGATCTGGTAGTATCTTGGCCGCCTTCGCCTTCTTGCTGGTACAATCAGTGATTTTCCTCTTACTGGGCTGGTACCTGTCACTCATCAACCCTGGACCTTACGGACAACCGCTGCCCATTAACTTCCTCTGTAGCTCGAGTTTTTGGACTAAGAAGCAAGTTGTCCCTGAGGAGACGATCGAGGAGGAAACTGAACTCGCAGAAAGGCAGGACCCCGCGTACTTTGAAACTCCGCCCAAGGACATGTATCCCGGGATTAGAATCGTGAACGTGTCCAAGGTGTTCCCTAAACACCGCGCACTGAATAAAGTATCCTTGGACGTGTACCGCGGAGAGATCACAGTACTGCTGGGACACAACGGAGCCGGGAAGACTACGCTGATGTCAATAATAACGGGAATGATGAATGCGACTGAGGGTAAGGTATACGTGGAGGGGTACGACACGACCACCCAGAAGAGCCAGATGAGAAAGCTGCTGGGTCTGTGTCCACAACACAACCTGTTCTTCCCGGACCTCACCATACAGGAGCACGTGATTTTCTTCACCATGCTCAAGGGGAGCTCGTATCAAGAGGCGGGCCAGTCATCGGCGAAGCTGCTGCAACAGCTGGGGCTGGGAGACAAGATGTCGGCGAACAGCAGCGACCTGTCCGGGGGCATGAAACGTCGTTTGCAGCTGGCTTGCTCATTGGCGGGAGAAGCGGCTGTACTGATCCTGGACGAACCGACTTCTGGACTGGACGTGGAAACTCGCAGGGAACTCTGGGATCTGCTGTTGTCGCTCCGCGGCTCCCGCACAGTCCTGCTGTCGACTCACTTCATGGAGGAGGCGGATGCTCTCGGCGACCGCGTGGCAGCGCTGCACTCCGGCAGGCTGGTGTGCCACGCCACCACCATGCACCTCAAGAAGGCTATCGGGACCGGCTACCGTCTGTCGTGCATCACCGTGGGCGTCCCTAACGAGCCCGCCATCACCTCCCTCATCACGTCCTACGTGCCGGACGCGACCCTCAAAGAGCAGACCCTCAATTCCCTCTCGTACAACCTCCCATCGAAGGACACCAGCAAATTCCCAAAACTCTTCAACAGCTTAGAATCAAAAAAATCCGAATTGGGAATCAACTCTATCGGAGTCGGCATTTCGACGCTGGAGGAAGTGTTCTTGAAACTATGTAGCGACACAAGCGCTGGACTGACGCTCGACGAGGTGGACACGGGACCCAGTGAACCCCAGTACGAGGTCCTCACGGGTATGAACCTGTGCGCGCGCCAGTTCGTGGCGCTGGTGAGGAGACAGCTGAAATATTTAAACGCGAGGAGATCGATTTTTTTGACTGTGAGTGTACTGCTGCCGATCAGCATGATGTTACTAATGACCTTCGCATTGAACACCGACAAGGCAAAGGAACAGAGCGACGACAGCGTGGCCCTGGACCTGGACCTGTACACGCAGCCGCACAGGAGGGTCATGGTGAACGTGGAACCCGGAACCAACGTCAGGGCCCTCGGAGACTCCTACCCCACCGTGGACTTTGAACTCACCAGCGACGTGGCCGACGCGATCTTCCGAACCAGCAAGAAGGATGTGTTTGAATACAACAAATACCTGGTCGGCATCGAACTGAACGAGACGCACGCCAATGCGCTATACACGACGGTGGTGCGTCACGCGGCGCCCGTGTCCTTGAGCATGTTGTCCAACACCCTGGCGACCCTGCTGCTGCCGGCGGCGGACGGGCGGGTGCTCTCCACCCACAACGACCCTCTCCAGGAACGGAATTCCCAGCGCCTGATCCAGCCCAAGTCATCCACACACGCGATGCTGTGGGGTATCGTGATCTGCATCACCGTCTTGACCACCGCGGCCAACTACATGTCTCTGGCGTGCAACGAGCGGGCGTCGGGCACCCGTCACCTGCACGTGCTGTCCGGCTGCTCGGTGGAGATCCACTGGGCCGCGACTCTGTTGTGTCACCTGGTGCTGTGTATCGTCACCCTGGCCCTGCCCGCCAGCATCGCCCCGCTGCTCGACGAGGACAGCACCATCGACGCTCCGGAATTCATGGGCGCCGCGTTCGTCCTGTTGGTGTGCGCGTTGCTGTCGTTCCTATCCTTCACGTACTTCCTGAGCCTGTTCTTCAAAGAGAGCACCGCGGGGATCGTCGCGATGATCTGCCTTATACTATTCGGTTTCTTCACGCCGACTCTGAAGACGGCCACGGAGGCGGTGCAGCAGAATCTAGACAGCTTCTGGGACTACCTGGTGCTGTTGGTGAGCTACACCATGCCACCGCACACTTGCGTGCGCGGGTTCATCAAGGCCACGGACGACGCCTGCGTCAACGCCATGTGCAAGCTGAACAGACCAGACGGCTGCAAGCTGGAAGGACACCTCACCGGAGCCGACCTCGACAAATGCTGCGTACAAAACATAAACCCGAGATGTTACATGTGCTTCGACAAACACGCGCCCATGGCTGAGTGTGCCGTCCTGCTGGGGCAGTTCGTGTTCTACATGGCCTTGGTGATTATATGTGAGAACGGTATCCCGAATAAATTGAAAGAGATGATATTCAACTCTTCGTACAGACCCACCAGCACCTCCGCCACGACCATGGTGTCCGCTGAGAAACAGTACGCTGATGAGGCGATAGCACTGCCTCCCCGGGATATTCCTGACGCGGTGTTGGTCAGCAACATCTACAAGAGATACTTCTCTGTCCTGTGCAAGCCGTTCGTAGCCGTCAAGGGACTCAGCTTTTCGGTTAAGAGAGGCGAATGTTTCGGTCTACTCGGCGTGAACGGGGCGGGCAAGTCCACCACCTTCAAGATGCTGGCCGGCCTCGAGTACCCCTCCAAAGGATCCATCTTCGCCAACGGACAGTTCATGAGCCGCTCGAGCAGCAAGTACCTCCACTCTCTGGGCTACTGTCCCCAGTTCTTCGGTCTGGACTCGTTCCTGTCGGGTCACGATAACCTGGCTTTACTGCTAACCCTCAAGGGACTCAGCCAGGACGACGTGGAAAGAGAAGTTGACACGTGGATCAGGATCGTGGGTCTCGAGCGCTACGCCCTCCAGGCCGTGTCGGGGTACTCCGGGGGCTGTGCCCGCCGTCTGTCCGCGGCGGCGTCGCTGTGCCCGGGAGCTCCGGTGGCGCTGCTGGACGAGCCCACGGCCGGCGTGGACGTAGCGGCCAGGAGACGAGTGTGGACCGCGCTCAGGAGGGCCGCGCCTAACAGAGCCATCATCATCACCTCGCATAGCATGGACGAGATGGAGGCGCTGTGCAGCCGCATCGGCATAATGGTGGCCGGTCGCCTGAGGGCGCTGGGCTCGGCCGCGGAGCTGCGCGCGACACACGCCTCGGGACACGCCGTGCGCCTCAAGCTGTCGTCCCCCCTCACGGACGCTGACGTACATATAACGCTGGCGTTGTCCCGACCCGCAGAGACCGACAGCACCGTGTCGGACATCGCGCGACTCAAGGCGACGCTCCACGACATGTTCGAGTGCACGCTGAGGGACGAGCACAAGACGATGCTGCACTATCACATAAACGAAACTCTACGATACAGCCAGCTGTTCACTCAGCTGGAGCAGCTCAAGAGGGACTTCCCCTCCCTGGTGGAGGACTACGACATCACGGAGACAACGCTCGAGGAAGTGTTCCTCACACTGGCTCGGGAACAGGAGGAGGAGGCGTATGAAGCGAGAGTCTAG

Protein sequence:

>DPOGS202910-PA
MNTLGVLMWKHMVVRKRRFIHTTVDILSPLAFFVLLYLFKGYITSGRRSAMSDEFIVQNTEPVDLDKLQGPTAVFYSPDTDLTGLLMDQVGESLHLQRKKYTPGYLEEFGYMPFQNLSDILDANRKLTDTDAIVLFENMNSTWPERLNYTIRMKGDFQTNKVTVNDESLGPHESFGTIYEPFMRLQWAIDTSYLKLLSGSDIKQRVTIQEFPYVRQQEVPAIKNVCNLLPFICWISLLLTFVYVMSKLLEERITGIQELIKMAGVSNFQIYLSHFLNMLPVGVIFCVFGTLVMTLTATPIIPQTSAFLIMIFLILHFMNVMCMAYCSNFLITNTQYSTSVAAVVYIVAELPISLIGKSYPTWARPIVGLLPFMPLHWFWWEVGEMEAYGKGAGFGSIATIHDAGSGSILAAFAFLLVQSVIFLLLGWYLSLINPGPYGQPLPINFLCSSSFWTKKQVVPEETIEEETELAERQDPAYFETPPKDMYPGIRIVNVSKVFPKHRALNKVSLDVYRGEITVLLGHNGAGKTTLMSIITGMMNATEGKVYVEGYDTTTQKSQMRKLLGLCPQHNLFFPDLTIQEHVIFFTMLKGSSYQEAGQSSAKLLQQLGLGDKMSANSSDLSGGMKRRLQLACSLAGEAAVLILDEPTSGLDVETRRELWDLLLSLRGSRTVLLSTHFMEEADALGDRVAALHSGRLVCHATTMHLKKAIGTGYRLSCITVGVPNEPAITSLITSYVPDATLKEQTLNSLSYNLPSKDTSKFPKLFNSLESKKSELGINSIGVGISTLEEVFLKLCSDTSAGLTLDEVDTGPSEPQYEVLTGMNLCARQFVALVRRQLKYLNARRSIFLTVSVLLPISMMLLMTFALNTDKAKEQSDDSVALDLDLYTQPHRRVMVNVEPGTNVRALGDSYPTVDFELTSDVADAIFRTSKKDVFEYNKYLVGIELNETHANALYTTVVRHAAPVSLSMLSNTLATLLLPAADGRVLSTHNDPLQERNSQRLIQPKSSTHAMLWGIVICITVLTTAANYMSLACNERASGTRHLHVLSGCSVEIHWAATLLCHLVLCIVTLALPASIAPLLDEDSTIDAPEFMGAAFVLLVCALLSFLSFTYFLSLFFKESTAGIVAMICLILFGFFTPTLKTATEAVQQNLDSFWDYLVLLVSYTMPPHTCVRGFIKATDDACVNAMCKLNRPDGCKLEGHLTGADLDKCCVQNINPRCYMCFDKHAPMAECAVLLGQFVFYMALVIICENGIPNKLKEMIFNSSYRPTSTSATTMVSAEKQYADEAIALPPRDIPDAVLVSNIYKRYFSVLCKPFVAVKGLSFSVKRGECFGLLGVNGAGKSTTFKMLAGLEYPSKGSIFANGQFMSRSSSKYLHSLGYCPQFFGLDSFLSGHDNLALLLTLKGLSQDDVEREVDTWIRIVGLERYALQAVSGYSGGCARRLSAAASLCPGAPVALLDEPTAGVDVAARRRVWTALRRAAPNRAIIITSHSMDEMEALCSRIGIMVAGRLRALGSAAELRATHASGHAVRLKLSSPLTDADVHITLALSRPAETDSTVSDIARLKATLHDMFECTLRDEHKTMLHYHINETLRYSQLFTQLEQLKRDFPSLVEDYDITETTLEEVFLTLAREQEEEAYEARV-