Monarch geneset OGS2.0

DPOGS207959
TranscriptDPOGS207959-TA4602 bp
ProteinDPOGS207959-PA1533 aa
Genomic positionDPSCF300090 + 82311-90102
RNAseq coverage44x (Rank: top 72%)
Annotation
HeliconiusHMEL0143260.066.45% 
BombyxBGIBMGA000310-TA0.088.94% 
DrosophilaCG42795-PA1e-13140.76% 
EBI UniRef50UniRef50_UPI00022C9DB82e-14943.21%UPI00022C9DB8 related cluster n=5 Tax=unknown RepID=UPI00022C9DB8
NCBI RefSeqXP_393843.22e-14443.10%PREDICTED: similar to CG3996-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|3504165054e-15044.02%PREDICTED: hypothetical protein LOC100749006 isoform 1 [Bombus impatiens]
NCBI nr blastxgi|3504165056e-14340.91%PREDICTED: hypothetical protein LOC100749006 isoform 1 [Bombus impatiens]
Group
Gene OntologyGO:00050974.1e-36Rab GTPase activator activity
GO:00056224.1e-36intracellular
GO:00323134.1e-36regulation of Rab GTPase activity
KEGG pathway 
InterPro domain[227-458] IPR0001954.1e-36Rab-GAP/TBC domain
Orthology groupMCL11965 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207959-TA
ATGAAAACAATTACTATAACTGAGCCTATTAATTTAAGAGTAGTTAAGTTTGAACTCGATTTTTCATCGCTATACGACTATAAACCTAAACATCGCAAGAAAAGCAAAGCAATCAATGTCCCGCATATAAACTGCGAAAAGAAACAAAACGAGATACCAAATCCACCAGAAATACAATGTAAACAAAAGAAGCAAAACGTAAAAGAATTGGTTGAAGAACTTTTAAATGATATTTATGCTGATCATCGAGATTGGACAACCATTAGCGGTCACGGATCGACTGCTGCATCTTTTACTTCCTCGCAATCAAACTGTGGGGAACAATCAGATTTCATCGAGATCAGCTATTTGGAATCTTTGGATATCAATGAACTCAAAGACCAAGTTCTGGAATGGAAAAGGTGTCTGCAAGTAGCTGGGGAACGTTTAGCCAAATGTTTACGGTTACGTGATCGTCTGCACCGTCAACAGAAAAAGCTATGTGCAGCATTCACAGTAATTCTACGTCATTTAAGACAGATGAGTGCTATTCTGAATTCTCTCAGGAACTTGGCAAGTTCTGCGAAGACTGCCTCAGCTGGCGATGCAAACATACGATTCGGGGTTACGCCAGGCAGCGAAGCCCCCGGCGAGGGTGGTTTTACGGAGTGGCTACACGCTATGCGTCTAGTTGCTCGCCTACCTGCTGGAGTACCACAGCACTTCCGCAGAAAGCTATGGTTGACGCTAGCAGAGCGCCACTTGACCGCGCGCGGGGTAGACTGGCCGGCCGCGGAGCGCGCGTGTTTTCGAGGCACCGCCCAGCCCGATGATACTGAACTCAGTGCTCAAATCCTTAAGGATCTGCACAGAACAGGCTGTTCCCTTTTTTGTGGAGTGGAAGGTCGCGAGAATCAAGCAATGCTTAGAAGAGTTTTGTTAGCTTACGCCAGGTGGAATAAGGATGTTGGTTATTGTCAAGGTTTTAATATGTTAGCAGCTATTATTTTGGAAGTTATGGATAAGTCGGAATCAGATTCTTTAAAGGTAATGATTTATTTAGTAGAGGCAGTTCTACCTGAAGGCTACTTTGCAGATGATCTTCGAGGTTTATCAGCTGATATGGCGGCTTTCAGAGATCTTTTGAGATTAAGACTACCTAGATTGGCACAACATATGGACCACTTGCAACGAATATCTGACGGTGGTGGTGTAGAGCCTCCTTTGCCAGATGTATTCACTATGCAATGGTTTCTTACATTATACGCGACATGGCTGCCTCGTGAGTCGTTGTTGAGAATTTGGGACTTAATCTTACTTGATGGCAATGAAGTTTTGTTACTGACTGCTTTGGCAATTTGGGATATGCTTCAGGACCGAATTCTATCAGCGCGGTCTGCAGACGAGTTCTACAGCTGTATGGGTGGCGGTGTGGGAGCTGTATGGGAAGCTGGAGAATCTTTGGTAGTGCGGGTTGTTGCGTTTAACTCTGTCCCAGAATTGCCAAGATTACGCAGTCTACATAGATACAGAGTAACACCGCCAGCACCTCCAAATGCACCTCCAGTATTTCACCCCCCTTTGCAATCAGCAATGCAGTCTATGACTAAGCGAGGATTAAGACTTTTCTATTCAGAAGACGAAGGAGAGTCTAGTGACGAAGGAAATAAAATGGCTTTAGCAACGGTGGCTACAAGACGAGAACAGAGACCTGGTGCTGGTGATAGGTTATCTCTCGATATTGGAGCCTTAAAACGACAATATGCTCGACTTCGTGAGAGGCAGCGACAGGCTCATGTCATTTTAGCGGCGGCCTGCGCGCGTCACGCAGCTGTTGGTGTCCCGACTTCGCCAACATCCCTTACAGTCAACAACCTTCTCCTGGGAAAAAGTGCTATTGTAAGTTCAAGAGGACGAAGACTTGGACCTCCTCCGGGTGCAGTACCACCATCTCGTCAAACTACAAATTCTAATGAAAAATTCGAACGGTACAGTAGCAGATCTTCAAATGATACTATAAGTTGGGAAGAAGAAAAAACTCGCAAAACACTCAATAGGAGAAATTCTGTAAAATGGAAAGATATAAAAAAAGATACGGCCTTAGAATTTGCTAGAAAACCATCTATTGATTTAGATGAATCTGGAGATGGAATTATTGTCTCTGAAATAGAAGCAAATCAAATGATTGCATCGATGAGATCACGAAGTTCTAGTGAAACATCGTCATATTCTTCGCGGTCTGAATCTAGTACAGACACAAGTCTTTGTGATGAAAATGAAAAAGCTAGTGATTCTGAAGCCGATTTACCCAATGATGATAAAGAAAATAATATAAATAGCACGAAAAACAAAATAAATTTACCCACAAACTCGCCCCCTTTGTACATAGAAAAGCAAAACAATGTATTGCCTTCTTCTAAAGACACGTCAATGCTACCATCGCCTACTTCAAACCAAAAATCTAATTCAGACGATCAAAGTGTAAAGAATATAACGGATTATCTTGGCTCCGCTGCCGGGGAGCCATTACGCGTTTATATAGATGGGTTTGAAATGAAATTAAAAGACACTCAAATGGTTAACAATGACAGCAAATCCTATACAGTTGTTAGTCCACTTAAACCCGTTCATTCACCACAGATAGACGTTGAAAGCAATAATTCATTTAAACACTTACAATATGAGCATAGGAATAATAAAAAAAATACAAATTTATTAAAATCCAAAAATAAAATACACCTAAGTCCCATCGACTTGTCCCCTCATAATTGTGTAAGAAAATTTCATGCACCTACAAAAAATCACAATAAATTAGAAACAGTATCCTCAAATAAACGTGATGATGTTTCATATACATCTCCAGATATAATCGATACTTTTGCAAGCTACCCGAATTTCGTACCCGATATTGACTTGTCTGATAAATATATGTCAGACCCAGAAAGTCCCTTGAGAAGCCCTGAATCCGAAAGCCTCATTGTCAGTGAAGATGAACCTCCTATTCCATTAAAAGTAGATAAATATTTTGAACATAGCCCTGAATTTTTCGGAGCACGATTAACTGATCCTCACAGTTCAAACATACCGAGTAGACGAGCTACGAAAAGAAATTCACGTTTACCTAAAAAAACAGACTCTTTAACTCACAACAGTGGAAAAAATGTTAAAAACGATGCAGAGAGAGCTTCATCTCTGCCACAACCTCATACTTTTGTATGTAAAAAGTCTGTAAGTCCAGAAGTAAAGGAAGAGAAAAAAGAAGGCCATAAATTAAGGAAAAAGTCATTTCAACACACTGAACACGTAGAAAAAGATTTTCAAAAACCTATTATTGAAGATATTGTAATAGATAAAGTAAATGAAAGCCCTCCATGTGTCAAAGAGTATGTACTAAAATCAGGCCGAAAAAATAGCGAAAGAGCTTTACAAATTATACAAGAAAATTCACAAATATTGAGCAGAATTTTAACAAAACAAAACTCTAATGCAGCAGCTAAAGAAAAAGAAATTATAGATACTACAAAAACAAAATTAGTCGAAGACGACGTAACACCTACCAACACAAGTAAAATAATAACGGATTTTTCTTTAGTCAAAGACAGCGCTGCTAAACTTAACTTACTTAAAGAATCTACATCTGATTCTTTGATTGGTTGTTTAAGAAGACCTTCAAGAGGCGAACCAGATGATAAAAATGATTTTCTTAGAGAAAGACCTATATCAATTGATGACCCTTTTTCTTTTGAAATTCGTAATGTCTTAACATCAAACGATGAATTTGATATTAAAAATTTAAAAACACCGAGCATTGAATGCCTCGTAAATAAACCAGATTATTACGGTTGGGAAAATAAGGGGTTCTTAGCAAAATGTGATTTTCGAAGCAATCTAAATCGACAAGAAATTTCGGAGGATGAATACAAAAAGAGAACCAACATCATTTGTCATGATTCGCAGCATAGCATAAGGAATCAAAATAATTTAAAAGGAGACTGGTCCAGTTGCTCATTTTCAGAGGAGTCGAAAATATCAGTCACACTTCCTTCATCAGAAATGAAGGACACTTCATCATTTGATCGTCGCTACACTTCGGACGATTATAACTATCCTATGACGAATTCCAGCAAATATAATGATTTTGTTATATCAAAGGCTAGTGATATATGTAGTAAAATAATAGATCGAAGTCCTAAAATTAGCGATAATGTAATAAAAACCGAAAATAGTCAATTTAGTGACTTTTCACACAAGGTTACATCATCTATTAGTTTTACATGTGAACCATCAATGGAAGACGATTCTCCAATGGGTGCAAAAAGTAATTCTAGTGATACATTGTGTCAAATTACGTCTTTAGATCAAATATCTACACCTATATCACCAAAAACTTTTCCCAATAGAGACACACCAAATTTGCCTAAAGATTCAGAATCAGACGATACTTGCAGCGCCATTACTTCTCTTCTAGAAACGGATACATTATCTTCTTTGTCTTACCCTCGCAGTCCATCGACTGGATCTTACCACCCTTTTCCGACGAGACCTGCTATTCGTCTGCCAAAAGATCTTGGAATTAAACTAGGAATGTATCCTAAAGAATCTTTTCCTTCTCCGCAGAAATGA

Protein sequence:

>DPOGS207959-PA
MKTITITEPINLRVVKFELDFSSLYDYKPKHRKKSKAINVPHINCEKKQNEIPNPPEIQCKQKKQNVKELVEELLNDIYADHRDWTTISGHGSTAASFTSSQSNCGEQSDFIEISYLESLDINELKDQVLEWKRCLQVAGERLAKCLRLRDRLHRQQKKLCAAFTVILRHLRQMSAILNSLRNLASSAKTASAGDANIRFGVTPGSEAPGEGGFTEWLHAMRLVARLPAGVPQHFRRKLWLTLAERHLTARGVDWPAAERACFRGTAQPDDTELSAQILKDLHRTGCSLFCGVEGRENQAMLRRVLLAYARWNKDVGYCQGFNMLAAIILEVMDKSESDSLKVMIYLVEAVLPEGYFADDLRGLSADMAAFRDLLRLRLPRLAQHMDHLQRISDGGGVEPPLPDVFTMQWFLTLYATWLPRESLLRIWDLILLDGNEVLLLTALAIWDMLQDRILSARSADEFYSCMGGGVGAVWEAGESLVVRVVAFNSVPELPRLRSLHRYRVTPPAPPNAPPVFHPPLQSAMQSMTKRGLRLFYSEDEGESSDEGNKMALATVATRREQRPGAGDRLSLDIGALKRQYARLRERQRQAHVILAAACARHAAVGVPTSPTSLTVNNLLLGKSAIVSSRGRRLGPPPGAVPPSRQTTNSNEKFERYSSRSSNDTISWEEEKTRKTLNRRNSVKWKDIKKDTALEFARKPSIDLDESGDGIIVSEIEANQMIASMRSRSSSETSSYSSRSESSTDTSLCDENEKASDSEADLPNDDKENNINSTKNKINLPTNSPPLYIEKQNNVLPSSKDTSMLPSPTSNQKSNSDDQSVKNITDYLGSAAGEPLRVYIDGFEMKLKDTQMVNNDSKSYTVVSPLKPVHSPQIDVESNNSFKHLQYEHRNNKKNTNLLKSKNKIHLSPIDLSPHNCVRKFHAPTKNHNKLETVSSNKRDDVSYTSPDIIDTFASYPNFVPDIDLSDKYMSDPESPLRSPESESLIVSEDEPPIPLKVDKYFEHSPEFFGARLTDPHSSNIPSRRATKRNSRLPKKTDSLTHNSGKNVKNDAERASSLPQPHTFVCKKSVSPEVKEEKKEGHKLRKKSFQHTEHVEKDFQKPIIEDIVIDKVNESPPCVKEYVLKSGRKNSERALQIIQENSQILSRILTKQNSNAAAKEKEIIDTTKTKLVEDDVTPTNTSKIITDFSLVKDSAAKLNLLKESTSDSLIGCLRRPSRGEPDDKNDFLRERPISIDDPFSFEIRNVLTSNDEFDIKNLKTPSIECLVNKPDYYGWENKGFLAKCDFRSNLNRQEISEDEYKKRTNIICHDSQHSIRNQNNLKGDWSSCSFSEESKISVTLPSSEMKDTSSFDRRYTSDDYNYPMTNSSKYNDFVISKASDICSKIIDRSPKISDNVIKTENSQFSDFSHKVTSSISFTCEPSMEDDSPMGAKSNSSDTLCQITSLDQISTPISPKTFPNRDTPNLPKDSESDDTCSAITSLLETDTLSSLSYPRSPSTGSYHPFPTRPAIRLPKDLGIKLGMYPKESFPSPQK-