Monarch geneset OGS2.0

DPOGS201401
TranscriptDPOGS201401-TA1869 bp
ProteinDPOGS201401-PA622 aa
Genomic positionDPSCF300083 + 580048-586406
RNAseq coverage2071x (Rank: top 6%)
Annotation
HeliconiusHMEL0128510.082.93% 
BombyxBGIBMGA002004-TA0.094.21% 
DrosophilaCG9281-PB0.083.60% 
EBI UniRef50UniRef50_Q9UG630.069.22%ATP-binding cassette sub-family F member 2 n=123 Tax=Eukaryota RepID=ABCF2_HUMAN
NCBI RefSeqNP_001040334.10.094.05%ATP-binding cassette sub-family F member 2 [Bombyx mori]
NCBI nr blastpgi|1140530010.094.05%ATP-binding cassette sub-family F member 2 [Bombyx mori]
NCBI nr blastxgi|1140530010.094.05%ATP-binding cassette sub-family F member 2 [Bombyx mori]
Group
Gene OntologyGO:00055241.1e-16ATP binding
GO:00168871.1e-16ATPase activity
GO:00001663.8e-08nucleotide binding
GO:00171113.8e-08nucleoside-triphosphatase activity
KEGG pathway 
InterPro domain[436-544] IPR0034391.1e-16ABC transporter-like
[421-589] IPR0035933.8e-08ATPase, AAA+ type, core
Orthology groupMCL14270 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201401-TA
ATGCCTTCGGACGCAAAGAAACGAGCCCAGCAAAAGAAAAAAGAACAAGCTACTAATAGAAACAAGAAGCCTGTGGCAAGAGTGACGACTGCAGAGACGAATGGTTCTACAAATGGTGCGGTAAAAGAAGATCGAGAGCTCACTGCTGAAGAGGCACTATGTCTGAAGCTGGAGGAAGAAGCGAAAATGAATTCTGAAGCCCGGTCCTGCACTGGTTCATTGGCAGTTCATATAAGATCAAGGGATATAAAGATTGCAAATTTTTCCATCACTTTTTATGGCAGTGAATTGCTACAAGACACATTGCTAGAGTTGAACTGTGGTCGAAGATATGGCCTTGTTGGTCTTAATGGCTGTGGTAAGTCATCATTGCTGGCTGTGTTGGGTCGGCGGGAGGTACCAATACCAGACCACATTGATATTTTCCATTTAACTCGTGAGATGCCAGCATCAGATAAAACAGCCTTGCAATGCGTTATGGAAGTTGATGAAGAGAGAATCAAGTTGGAGAGACTTGCTGAGGAACTCGCCCAGTGTGATGACGATGAATCCCAGGAACAGCTGTTGGATGTTTATGACCGTCTAGATGATTTAAGCGCTGACACTGCCGAGGCTAGAGCAGCCCATATTCTTCATGGTCTAGGATTCAGCAAGGAGATGCAACAAAAGGCGACCAAGGATTTCTCAGGAGGTTGGCGTATGAGAATTGCTTTGGCGCGTGCGTTATATGTGAAGCCACATCTGCTTTTGTTGGACGAGCCAACTAACCATCTTGATTTAGACGCCTGTGTGTGGCTTGAAGAGGAACTCAAACAATATAAGAGAATCCTAGTGCTGATATCGCACTCTCAAGATTTTCTGAATGGAGTTTGTACCAATATAATTCACATGAGCAAGCGTAGGCTGAAGTACTACACTGGTAACTATGAGGCCTTCGTAAGGACCCGCATGGAGTTGTTGGAGAACCAGATGAAACAGTACAATTGGGAACAAGATCAGATCGCTCATATGAAGAACTACATAGCTCGGTTTGGTCACGGTTCAGCTAAACTCGCTCGTCAGGCCCAGTCAAAGGAGAAGACACTAGCCAAGATGGTAGCTCAAGGTCTTACTGAGAAGGTGACGGATGATAAGATACTCAACTTCTACTTCCCGTCATGCGGGAAAGTACCACCGCCCGTTATCATGGTTCAGAATGTTTCATTCCGCTACACAGACAGTGGACCTTGGATATATAAGAATCTGGAGTTCGGTATAGATCTGGACACCCGGCTAGCGCTCGTCGGACCAAATGGAGCAGGAAAGTCTACGTTGTTGAAACTTCTATATGGAGATTTGGTGCCGTCGACAGGTATGATCCGTAAGAATTCTCATTTGCGCATCGGTCGTTACCACCAGCACCTTCACGAGCTGTTAGATCTAGACTTGTCCCCGTTGGAGTATATGATGAAGGAGTTCCCAGAAGTACGAGAAAGGGAGGAAATGAGAAAAATCATTGGAAGATATGGTCTTACTGGAAGACAACAGGTGTGTCCAATGCGTCAGTTGTCTGATGGCCAACGTTGTCGCGTTGTATTCGCCTGGCTGGCGTGGCAGACACCTCACCTGCTGTTGATGGACGAACCGACCAATCACTTGGATATGGAGACCATAGACGCCCTCGCTGACGCCATTAACGACTTCGACGGCGGTATGGTGCTTGTCAGCCACGACTTCAGACTTATCAACCAGGTCGCCGAAGAAATTTGGATCTGCGAGAATGGTACAGTAACCAAATGGCAAGGAGGTATCCTGAAATATAAAGATCATTTGAAATCTAAGATCTTGAAAGATAACGCCGATAATGCGGCGAAATTCAACCGCAAATAG

Protein sequence:

>DPOGS201401-PA
MPSDAKKRAQQKKKEQATNRNKKPVARVTTAETNGSTNGAVKEDRELTAEEALCLKLEEEAKMNSEARSCTGSLAVHIRSRDIKIANFSITFYGSELLQDTLLELNCGRRYGLVGLNGCGKSSLLAVLGRREVPIPDHIDIFHLTREMPASDKTALQCVMEVDEERIKLERLAEELAQCDDDESQEQLLDVYDRLDDLSADTAEARAAHILHGLGFSKEMQQKATKDFSGGWRMRIALARALYVKPHLLLLDEPTNHLDLDACVWLEEELKQYKRILVLISHSQDFLNGVCTNIIHMSKRRLKYYTGNYEAFVRTRMELLENQMKQYNWEQDQIAHMKNYIARFGHGSAKLARQAQSKEKTLAKMVAQGLTEKVTDDKILNFYFPSCGKVPPPVIMVQNVSFRYTDSGPWIYKNLEFGIDLDTRLALVGPNGAGKSTLLKLLYGDLVPSTGMIRKNSHLRIGRYHQHLHELLDLDLSPLEYMMKEFPEVREREEMRKIIGRYGLTGRQQVCPMRQLSDGQRCRVVFAWLAWQTPHLLLMDEPTNHLDMETIDALADAINDFDGGMVLVSHDFRLINQVAEEIWICENGTVTKWQGGILKYKDHLKSKILKDNADNAAKFNRK-