Monarch geneset OGS2.0

DPOGS209040
TranscriptDPOGS209040-TA1353 bp
ProteinDPOGS209040-PA450 aa
Genomic positionDPSCF300102 - 57812-61236
RNAseq coverage661x (Rank: top 19%)
Annotation
HeliconiusHMEL0060880.095.08% 
BombyxBGIBMGA009992-TA0.095.34% 
DrosophilaPast1-PA0.084.60% 
EBI UniRef50UniRef50_Q299V40.084.22%GA19392 n=2 Tax=pseudoobscura subgroup RepID=Q299V4_DROPS
NCBI RefSeqXP_001654538.10.086.65%past-1 [Aedes aegypti]
NCBI nr blastpgi|3071676850.085.98%EH domain-containing protein 1 [Camponotus floridanus]
NCBI nr blastxgi|1571261050.086.65%past-1 [Aedes aegypti]
Group
Gene OntologyGO:00055251.5e-22GTP binding
GO:00039241.5e-22GTPase activity
KEGG pathwayaag:AaeL_AAEL0104030.0 
 K12483 (EHD1)maps-> Endocytosis
InterPro domain[58-218] IPR0014011.5e-22Dynamin, GTPase domain
Orthology groupMCL10456 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209040-TA
ATGTTTAGCTGGCTTAAAAAAGAAGGTGAAAAAACTGAGAGCATCGAAAATGTCGTCGAGGGTCTCAAAAGAATATACAAAACGAAACTTTTGCCGCTGGAATCACATTACCAGTTTCATGATTTCCATTCCCCACAACTTGAAGACCCTGACTTTGATGCAAAACCCATGATACTCCTGGTCGGACAATACTCTACCGGCAAAACGACCTTTATCAAATATCTCTTAGAGAGAGATTTTCCTGGAATACGTATCGGACCGGAGCCCACAACAGATAGATTTATTGCTGTTATGTTTGATGAGAAAGAAGGAATGATTCCCGGGAATGCACTCGTTGTGGATCCCAAGAAACAATTTCGACCTCTCAGTAAATTTGGAAACGCCTTCTTAAACAGATTCCAATGTTCAACGGTAAACTCCCCTGTATTGAGAGGTATTTCAATCGTCGACACACCCGGTATTCTATCGGGGGAGAAACAGCGTGTCGACAGAGGTTACGACTTCACGGGTGTTCTGGAATGGTTCGCGGAGCGTGTCGACCGTATCGTACTCCTTTTCGACGCCCACAAACTCGATATTTCAGACGAATTCAGACGGAGTATCGAAGCTCTGAGGGGACACGATGATAAAATACGGATTGTTCTTAACAAGGCAGACATGATCGATCATCAACAGCTCATGCGAGTGTATGGAGCTCTGATGTGGTCGCTGGGCAAGGTGCTGCAGACACCTGAGGTGGCCCGTGTGTACATAGGATCATTCTGGGACCAACCTCTCCGCTACGACGTCAATAGACGGCTTTTTGAGGACGAGGAGCAGGATCTCTTCCGAGACATGCAGTCGCTCCCTAGGAATGCAGCATTACGTAAATTGAATGACTTAATTAAAAGGGCACGGCTCGCTAAAGTGCATGCTTATATTGTTAGTGAGTTGAGAAAAGAAATGCCTTCAATGTTCGGAAAGGATGGTAAGAAGAAGGAATTAATAAAAAATCTTGGTCAAGTGTATGATAGAATACAAAAAGAAATGCAAATATCTCCTGGTGATTTCCCTGATATCAAGAAAATGCAAGAAACTCTGGCAAACCACGACTTCACCAAGTTCCATCCTCTCAAACCCAAGTTGCTAGAGGTAGTGGACCACATGCTGGCCACCGACATCGCAAGACTCATGGACATGATCCCGCAGGAAGATGTCAATGTTGTCTCCGAACCTCTCATAAGGGGCGAGGCTGACAGCCCGGTGTCGGAGGTGGTCCGGCGCCTCCATGTCCGCCGCGGCTTCCTCGCAGGTAAGCTGCGTCGGGTCTCGGAGCACTCGTCCTGTGTCTCATGTCCTCCACGTGCTAAATAA

Protein sequence:

>DPOGS209040-PA
MFSWLKKEGEKTESIENVVEGLKRIYKTKLLPLESHYQFHDFHSPQLEDPDFDAKPMILLVGQYSTGKTTFIKYLLERDFPGIRIGPEPTTDRFIAVMFDEKEGMIPGNALVVDPKKQFRPLSKFGNAFLNRFQCSTVNSPVLRGISIVDTPGILSGEKQRVDRGYDFTGVLEWFAERVDRIVLLFDAHKLDISDEFRRSIEALRGHDDKIRIVLNKADMIDHQQLMRVYGALMWSLGKVLQTPEVARVYIGSFWDQPLRYDVNRRLFEDEEQDLFRDMQSLPRNAALRKLNDLIKRARLAKVHAYIVSELRKEMPSMFGKDGKKKELIKNLGQVYDRIQKEMQISPGDFPDIKKMQETLANHDFTKFHPLKPKLLEVVDHMLATDIARLMDMIPQEDVNVVSEPLIRGEADSPVSEVVRRLHVRRGFLAGKLRRVSEHSSCVSCPPRAK-