Monarch geneset OGS2.0

DPOGS203261
TranscriptDPOGS203261-TA2241 bp
ProteinDPOGS203261-PA746 aa
Genomic positionDPSCF300229 - 54306-60878
RNAseq coverage596x (Rank: top 21%)
Annotation
HeliconiusHMEL0153600.095.83% 
BombyxBGIBMGA000447-TA0.093.81% 
Drosophilacomt-PA0.073.95% 
EBI UniRef50UniRef50_P464610.073.95%Vesicle-fusing ATPase 1 n=17 Tax=Opisthokonta RepID=NSF1_DROME
NCBI RefSeqXP_001120201.10.077.39%PREDICTED: similar to Vesicular-fusion protein Nsf1 (N-ethylmaleimide-sensitive fusion protein 1) (NEM-sensitive fusion protein 1) (dNsf-1) (Protein comatose) isoform 2 [Apis mellifera]
NCBI nr blastpgi|65808080.095.44%N-ethylmaleimide sensitive fusion protein [Manduca sexta]
NCBI nr blastxgi|65808080.095.44%N-ethylmaleimide sensitive fusion protein [Manduca sexta]
Group
Gene OntologyGO:00055243.6e-39ATP binding
GO:00054884.4e-32binding
GO:00001661.6e-17nucleotide binding
GO:00171111.6e-17nucleoside-triphosphatase activity
KEGG pathwayame:7256800.0 
 K06027 (NSF)maps-> Vasopressin-regulated water reabsorption
InterPro domain[256-396] IPR0039593.6e-39ATPase, AAA-type, core
[4-84] IPR0090104.4e-32Aspartate decarboxylase-like fold
[252-399] IPR0035931.6e-17ATPase, AAA+ type, core
[5-85] IPR0033389.7e-16ATPase, AAA-type, VAT, N-terminal
[111-157] IPR0042019.3e-11Cell division protein 48, Cdc48, domain 2
Orthology groupMCL11186 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203261-TA
ATGTCTTCAATGCGCATGAAGGCAGCCAAATGTCCATCAGACGAACTGGCTATCACCAACTGTGCTCTAGTCAACCCGGACGATTTCCACAGTGATGTTAAGCATATTGAGATATCAACAGCTCCATCTCAACACTTTGTATTCAGTATAAGATTTTACAGTGGAGTTGATAGAGGCACCGTGGGATTTTCTGCTCCACAAAGAAAATGGGCTACTCTGTCTATTGGACAGACTATCGAAGTCAAACCATTCAAGCCACAGAGTGCGGAATGTTTGTGCAGTGTCACTCTGGAGGCTGATTTTATGCTAAAGAAAACAACCTCCATGGATCCCTATGATTCAGAGCAGATGGCCAGAGACTTCCTCATCCAGTTCTCAAACCAAGTGTTCACCGTTGGACAGCAACTTGCTTTCTCATTCCAAGAGAAAAAAGTACTCTCTTTGATTGTCAAGAACTTGGAAGCGGTGGACGTGCAAGCGCTGGCCGCTGGTTCGAACGCTGTACCTCGTCGGGTCCGTATGGGTCGTCTGCTACCAGACGCCTGTATACAGTTCGACAAGGCCGAGAACTCCTCGCTCAACCTCGTGGGAAAAGCTAAGGGGAAGCAGCCTCGTCAGTCGATCATAAATCCTGATTGGGACTTCGGTAAGATGGGCATCGGCGGTCTGGACAAGGAGTTCAACGCTATCTTCAGGCGAGCGTTCGCGTCACGAGTGTTCCCACCGGAAGTGGTCGAACAATTAGGCTGCAAACACGTGAAAGGCATCCTGCTATACGGCCCGCCCGGTACCGGTAAAACTTTGATGGCCCGACAGATCGGTAAGATGTTGAACGCGAGGGAGCCTAAGATCGTTAATGGTCCCCAAATATTGGACAAATACGTCGGCGAGAGTGAGGCCAACATTCGCAGGCTGTTTGCTGACGCCGAGGAAGAAGAGAAGAGGTGTGGTCCGAACAGCGGCCTGCACATTATCATCTTCGACGAAATCGACGCTATTTGTAAGGCGAGAGGCTCAGTGGGCGGCAACACGGGCGTCCATGACACCGTCGTCAACCAGCTGCTCTCCAAAATAGATGGTGTGGACCAATTGAACAATATTTTGGTCATTGGTATGACTAACAGGAGGGACATGATAGATGAAGCGCTCATGAGACCCGGGCGACTTGAAGTACAGATGGAGATAGGTTTGCCAGACGAGAAAGGAAGGGTGCAGATATTGAACATCCACACCAAGCGGATGAAAGAGTACAAGAAGATCTCCGAGGACGTCGATAATAAGGAGTTGGCAGCCCTGACGAAGAACTTCTCCGGAGCTGAACTTGAAGGTTTGGTTAGGGCTGCGCAGTCCACGGCCATGAACAGACTCATAAAGGCATCCAGTAAAGTGGAAGTAGATCCTGAAGCCATGGAAAAACTCATGGTGGAGAGAGGAGATTTCCTACATGCCTTGGAAAATGATATTAAGCCGGCATTTGGTACAGCTGCCGAAGCCCTGGAACACTTCCTGGCTCGAGGCGTCATCAACTGGGGTCTCCCTGTGTCTTCGTTGTTGGAGGACGGACAACTTTATATACAGCAGTCTAGAGCCACTGAAGCCAGCGGCCTAGTATCCGTGCTGTTAGAAGGTCCTCCAAACAGCGGTAAGACGGCCCTAGCAGCTCAGTTGGCCAAGATGTCTGACTTCCCGTTCGTGAAGGTTTGCTCTCCGGAAGACATGGTCGGCTTCACTGAGACAGCCAAGTGCTTGCAGATAAGAAAGTACTTCGACGACGCATACCGCTCCAGCCTGTCCTGTATATTGGTGGACAATATCGAGAGGCTGTTGGACTACGGTCCTATAGGGCCGCGGTATTCCAACCTCACGCTGCAAGCCCTGTTGGTGCTTCTCAAGAAACAACCTCCCAAAGGACGTAAACTGCTCATACTGTGTACCAGCAGTCGCAGACAAGTCCTCGAAGACATGGAAGTTCTATCAGCGTTCACGGGTGTACTCCACGTTCCTAACCTGTCTCAACCTGAGCACGTGATGACAGTGCTTGAAGAAAGCGACGCCTTCACTAAACGCGATCTGGCCAAAATACAGCACGACCTGAGAGGGGCCAAAATTTTCATCGGGATCAAAAAGCTGTTGGCGTTGATTGACATGGTGAAGCAAACGGACGAAGAGTCCAGGGTGTTCAAGTTCCTGACGAAGATGCAGGAGGAGGGCAGCCTTGACCTGGGCACTACCATACAATAA

Protein sequence:

>DPOGS203261-PA
MSSMRMKAAKCPSDELAITNCALVNPDDFHSDVKHIEISTAPSQHFVFSIRFYSGVDRGTVGFSAPQRKWATLSIGQTIEVKPFKPQSAECLCSVTLEADFMLKKTTSMDPYDSEQMARDFLIQFSNQVFTVGQQLAFSFQEKKVLSLIVKNLEAVDVQALAAGSNAVPRRVRMGRLLPDACIQFDKAENSSLNLVGKAKGKQPRQSIINPDWDFGKMGIGGLDKEFNAIFRRAFASRVFPPEVVEQLGCKHVKGILLYGPPGTGKTLMARQIGKMLNAREPKIVNGPQILDKYVGESEANIRRLFADAEEEEKRCGPNSGLHIIIFDEIDAICKARGSVGGNTGVHDTVVNQLLSKIDGVDQLNNILVIGMTNRRDMIDEALMRPGRLEVQMEIGLPDEKGRVQILNIHTKRMKEYKKISEDVDNKELAALTKNFSGAELEGLVRAAQSTAMNRLIKASSKVEVDPEAMEKLMVERGDFLHALENDIKPAFGTAAEALEHFLARGVINWGLPVSSLLEDGQLYIQQSRATEASGLVSVLLEGPPNSGKTALAAQLAKMSDFPFVKVCSPEDMVGFTETAKCLQIRKYFDDAYRSSLSCILVDNIERLLDYGPIGPRYSNLTLQALLVLLKKQPPKGRKLLILCTSSRRQVLEDMEVLSAFTGVLHVPNLSQPEHVMTVLEESDAFTKRDLAKIQHDLRGAKIFIGIKKLLALIDMVKQTDEESRVFKFLTKMQEEGSLDLGTTIQ-