Monarch geneset OGS2.0

DPOGS200379
TranscriptDPOGS200379-TA2559 bp
ProteinDPOGS200379-PA852 aa
Genomic positionDPSCF300026 + 1090740-1100432
RNAseq coverage400x (Rank: top 30%)
Annotation
HeliconiusHMEL0053820.082.59% 
BombyxBGIBMGA007218-TA1e-12050.98% 
DrosophilaCG1718-PB1e-14234.82% 
EBI UniRef50UniRef50_Q7PZY92e-15441.55%AGAP012155-PA n=9 Tax=Culicidae RepID=Q7PZY9_ANOGA
NCBI RefSeqXP_001653234.12e-16539.39%ATP-binding cassette sub-family A member 3, putative [Aedes aegypti]
NCBI nr blastpgi|3123729042e-16439.11%hypothetical protein AND_19498 [Anopheles darlingi]
NCBI nr blastxgi|1583004601e-16339.10%AGAP012156-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055245.6e-13ATP binding
GO:00168875.6e-13ATPase activity
GO:00001663.6e-05nucleotide binding
GO:00171113.6e-05nucleoside-triphosphatase activity
KEGG pathwaymdo:1000132811e-124 
 K05643 (ABCA3)maps-> ABC transporters
InterPro domain[574-694] IPR0034395.6e-13ABC transporter-like
Orthology groupMCL10087 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200379-TA
ATGAGGAATAAATTAGCTTCTTTCATACAAATAATAACACCGATAATAAATATATCAATATCCGTGTGGATAGCTCGTTCTTGGAAATTCATGTCCCAATTACCACCATTGGAACTCAGTTTGGAGAGTGGCTTTAGAAAAACTGTCACTTTAGTATCAGAAGGGACGAACTTAACTGATAACAGCATAGAAAGGAGAGCGATGATGGCTTATAAGGACTATTTCAAAAGCAGTTCAGATCCGACAATGTTATTGACTGATATCGGAAGATTGGATTTATCCAAGTTCTATTTGAAATTGCTTCAAGCGGATTTGCCAAGGGTTCGTTATGAGAACTTAGTTGGGGCTACGTTCGCACCTCAACGTATAACAGCGTGGTTCAGTAACTATGGTTACCACGACTCGGCTATATCACTCGCCATGGCCAATAATGCCATCATGGGAGCTCTATCACCAGGGAGCTCCTTAAAATTTATCAACCATCCCCTGCCCTACTCCATCGAAAATTTGCGTAATGTAATTTCGAAGCATCTATCTGGAGGTCAGAAGCGTCGTTTGTCAGTGGGGGCGGCGATGTGTGGCTCATCTAGAGTGGTACTATTAGATGAACCAACATCCGGTCTAGATCCAGCCGCTAGACGATCGCTATGGGACCTACTGCAGAGGGAAAAGAAAGGTCATATAAAATGTCTCATACCGACCTCGTTCCAGGTCCGCGTGATGGCGAGCGGCAGCAGTATGGGCTTCCAGTTCGCATTTAATATTGGATTCTGTATGGCTTTTGTTACATCTTTTTTGGTTCTCTTCGCTATTAAGGAACGTGTAAGTGGCGCGAAGCTCCTCCAGCGGGTGTCGGGAGTACGTCCCGCAATAATGTGGACAACCGCCCTCATATGGGATTGGTTCTGGCTGTTCATAGTGTTCATAGCGATCATAGTCACACTTGGACTCTTCCAGGAGAACACACTAGCGACACCGGCTGAATTAGGGAGGGTGATGTTGGTGCTGATTATATTCGCCTTTGCAATGATCCCGTTGCACTATTTAGCGTCTTTTTATTTCGAAGCATCAGCGACTGGTTTCTCAAAAATGTGCTTCATCAATATATTTTCAGGTTGTATGCCTTTCTTAATAACGGAAGTGCTGAGGTTACCGGAAGTCGGTAATCCGTACTACGCCCATATATTTGACTGGGTCTTCTCGCCTTTACCCATATACTGTATCAGTAGGAGCTTCAGGGATATGAGCGTGTCTGCGTTCTCTCTGCTGGCGTGTGACGCTCTCTGCGCCCAGCTCCCCGGTGTGAACTGTACCCGGTTCACGGTCTGCACGAAACTCAACGTCTCGGTTTGTTGTATGGAGGACGATCCCTTCCTGAGATGGAGTGAACCAGGCATCGGTCGTTATCTCTTCACGATGACTCTAGTCGGACTAATCTCGTTCACGATACTACTTATAAAAGAATACGAAATATTGAATAAGGTATTCTATTCCGAGAGCAAGCACGGCCTACCTGCTTTAGTAGCTGATGAAGACAGCGATGTTGCTAACGAAAGACAAACCGTGAGGGCGTTCACTAGAAACGAATTGACGCAGCACAGTCTCGTGTGTAGAGACTTAACTAAGTACTATAAGGACTTCCTGGCGGTCAACAGACTTAGTTTCGCGGTACATAAGGGTGAATGCTTCGGCCTTTTGGGGATTAACGGCGCGGGGAAGACCAGCACGTTCAGAATGCTTACCGGAGACTCCAGGCTGAGTTGTGGAGACGCTTACGTACACGGACTGTCGCTCAAGACGCGTATACAGGATGTTCACCGACACATCGGATATTGTCCGCAATTCGACGCGTTGTTGGAGAATCTAACAGCACGAGAAACTTTAAAGATATTCTGTCTCCTGCGTGGAATACCTGTTAAAGTTGGATCCGCGAGGGCCATACAGTTGGCAGAAATGTTGGGATTCTTCAGACATTACGATAAAAAGGTTCACGAATGTAGTGGTGGCACTAAAAGGAAAATTAGTACAGCACTAGCACTCCTTGGGGATTCACCTCTAGTGTTCCTTGACGAACCAACCACAGGCATGGATCCTGCCTCAAAGCGTTTAGTGTGGCGCTGTGTTAGCGAGGCAGCGGCCGGAGGAAGAAGCGTGGTCCTTACGTCGCACAGTATGGAAGAATGTGAGGCGCTGTGTTCGCGTTTAACGGTCATGGTAAATGGCCAACTGTACTGCCTCGGACCTCTTCAGCATCTCAAGAATAAATTCTCACAAGGTTACACACTTATCGTGAAATGTTCTTCCGGCGCAGACAGAGATGCCACTGTAGCGAAAATAAACCAATACGTCACGGACAACTTTCGGGACGCTAAACTTATTGAGACGTACCTGGGCATAAGTACTTATTATCTGAACGACCAAGACCTTCCGTGGTGGAGAGTTTTTCATCTCATGGAAGAAGCCAGAAGCCAGTTCCCCATAGAAGACTATTCTGTATCTCAGACCACGCTGGAGCAAGTGTTCCTTCGCTTCACCAGGAATCAGGGTCGAGGGGATTAG

Protein sequence:

>DPOGS200379-PA
MRNKLASFIQIITPIINISISVWIARSWKFMSQLPPLELSLESGFRKTVTLVSEGTNLTDNSIERRAMMAYKDYFKSSSDPTMLLTDIGRLDLSKFYLKLLQADLPRVRYENLVGATFAPQRITAWFSNYGYHDSAISLAMANNAIMGALSPGSSLKFINHPLPYSIENLRNVISKHLSGGQKRRLSVGAAMCGSSRVVLLDEPTSGLDPAARRSLWDLLQREKKGHIKCLIPTSFQVRVMASGSSMGFQFAFNIGFCMAFVTSFLVLFAIKERVSGAKLLQRVSGVRPAIMWTTALIWDWFWLFIVFIAIIVTLGLFQENTLATPAELGRVMLVLIIFAFAMIPLHYLASFYFEASATGFSKMCFINIFSGCMPFLITEVLRLPEVGNPYYAHIFDWVFSPLPIYCISRSFRDMSVSAFSLLACDALCAQLPGVNCTRFTVCTKLNVSVCCMEDDPFLRWSEPGIGRYLFTMTLVGLISFTILLIKEYEILNKVFYSESKHGLPALVADEDSDVANERQTVRAFTRNELTQHSLVCRDLTKYYKDFLAVNRLSFAVHKGECFGLLGINGAGKTSTFRMLTGDSRLSCGDAYVHGLSLKTRIQDVHRHIGYCPQFDALLENLTARETLKIFCLLRGIPVKVGSARAIQLAEMLGFFRHYDKKVHECSGGTKRKISTALALLGDSPLVFLDEPTTGMDPASKRLVWRCVSEAAAGGRSVVLTSHSMEECEALCSRLTVMVNGQLYCLGPLQHLKNKFSQGYTLIVKCSSGADRDATVAKINQYVTDNFRDAKLIETYLGISTYYLNDQDLPWWRVFHLMEEARSQFPIEDYSVSQTTLEQVFLRFTRNQGRGD-