Monarch geneset OGS2.0

DPOGS203839
TranscriptDPOGS203839-TA1827 bp
ProteinDPOGS203839-PA608 aa
Genomic positionDPSCF300010 + 2668041-2675912
RNAseq coverage2127x (Rank: top 6%)
Annotation
HeliconiusHMEL0069452e-16889.63% 
BombyxBGIBMGA003744-TA0.080.47% 
Drosophilapnut-PB7e-16861.01% 
EBI UniRef50UniRef50_Q7QJN65e-18072.06%AGAP007596-PA n=16 Tax=Endopterygota RepID=Q7QJN6_ANOGA
NCBI RefSeqXP_001600879.10.073.13%PREDICTED: similar to septin [Nasonia vitripennis]
NCBI nr blastpgi|3838543560.077.28%PREDICTED: protein peanut-like [Megachile rotundata]
NCBI nr blastxgi|3800259130.064.44%PREDICTED: protein peanut-like [Apis florea]
Group
Gene OntologyGO:00055259.3e-248GTP binding
GO:00070499.3e-248cell cycle
KEGG pathwaytgu:1002208743e-100 
 K04557 (PNUTL1, CDCREL1)maps-> Parkinson's disease
InterPro domain[158-587] IPR0000389.3e-248Cell division protein GTP binding
Orthology groupMCL13433 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203839-TA
ATGCGACGGATATCCATTTTGAATTTAGTCCGAGAATACAGTTGTCGGCTTAGCGAAAACTCCCTCGCGTGCACCGCGTTGCTAGCTCTTAGAAAAGATATGTTACAGAGTAAGCGTGAGCTGTTCTTCAAATCGGATACGGTGAGCGGCGGCGGGGTGAGCGCTACAGGGTTGTCGACTGCAGCGCCACCCGCTCCGCCGCTGGGTTCCTCAGCCCTCAAGGAAGCGCTCGCGAAGAGATCCGCAACCCTGCCAGATCCGCCAGAGAACAGTCATCATGAGGCTAACGGTACAGCCCCGGCTTCAAAACTCAGCAATAGCGTGTCCGTGGATACCGGCAAGATAAATGCGATGCCACCACAGATGCCAGCACCACCCGTGCCTACCACCGCCCCACCACCAGTCTCCGCACCACTACGGAGTACGGAGAATATTATGAAGAAATCTGACCACCCACCGGTCGCTCCAAAACCAGATCTACCTAAGATAGAGAAACCAAAGACAAAAGAATTAGATGGATATGTGGGGTTTGCGAATTTACCCAACCAAGTCTATAGGAAAGCTGTGAAAAAGGGTTTCGAATTCACGTTAATGGTTGTAGGCGAGACGGGATTGGGCAAATCAACGCTTATCAACTCATTATTCCTAACGGAAGTTTACGACAAGGACAAGCATCCGGGTCCATCGCTACGAGTCAAGAAGACTGTCGGCGTCGAGACCAGTGTGGTCCTGTTGAAGGAGAACGGGGTCAACCTGACCCTTACCATCGTCGACACACCCGGCTTCGGAGACGCGGTCGACAACAGTAATTGCTGGCAACCAATAATCGATTTTGTTGAATCGAAGTACGAGGAGTTTCTGAACGCGGAATCCCGTGTTACCCGAAAGGCTGCCCCGGCTGACACGCGAGTGCATTGCTGCCTTTACTTCATAGCTCCCAGCGGACACGGCCTTAAGCCGCTGGACGTAGAGTTCATGCAACGGCTCGGGGATAAAGTCAACATCATACCAGTCATCGCCAAAGCTGATACTATGACACCTGAAGAGTGCAAGGACTTCAAAGAACAGATATTAAAGGAGATAGCGCAGCACAAGATAAAGATATACGAGTTCCCGGAGAGCACGGGCGAGGAGGGCGAGGGGGCGGACACCAACCGCGCGCTGCGAGCACGGGTGCCCTTCGCGGTGGTGGGGGCTAACACTGTCATAGAACAGGACGGACGGCGAATAAGGGGCCGGAAGTACCCCTGGGGTATAGCGGAAGTGGAGAATTTGGAGCATTGTGATTTCCTGGCGCTCCGCAATATGGTGATTCGTACACATCTGCAAGACCTCAAAGACGTCACCAGCTCAGTCCACTACGAGAACTACCGATGCCGCAAACTGGCCGGTCTAACACACGACGGCCAGCCTCACGGTCTCAACTCCAACAATTTCTGCCCGCAAGGACTGATGAACAGTTTCATGACCGTGTGGAATCCATTAGCTCAGATGGAAGAAGAGAAAAGAGAGCATGATCTCAAAATGAAGAAGATGGAATGTGAAATGGAACAGGTGTTCGATCAAAAGGCCCGCGAGAAGCACGCCAAGTTGAAGGAGTCCGAGGCGGAACTGGCTCGGCGGCACGAGGCCACGCGGCGTGCGCTCGAAGCCCAGGCCCGGGACCTGGAGGAGAGACAGCGCGCGCTGAGGGCCGAGCAGGCCGCCTGGGAGAGAGACACCGGACTCTCGCTCGATGATCTACGTAGGAGGTCGCTTGAGGCCAACAGCAAGGAGACGGTCGATGGGAAGGATAAGAAAGACAAGAAAAAGAAAGGTTTGTTTTAA

Protein sequence:

>DPOGS203839-PA
MRRISILNLVREYSCRLSENSLACTALLALRKDMLQSKRELFFKSDTVSGGGVSATGLSTAAPPAPPLGSSALKEALAKRSATLPDPPENSHHEANGTAPASKLSNSVSVDTGKINAMPPQMPAPPVPTTAPPPVSAPLRSTENIMKKSDHPPVAPKPDLPKIEKPKTKELDGYVGFANLPNQVYRKAVKKGFEFTLMVVGETGLGKSTLINSLFLTEVYDKDKHPGPSLRVKKTVGVETSVVLLKENGVNLTLTIVDTPGFGDAVDNSNCWQPIIDFVESKYEEFLNAESRVTRKAAPADTRVHCCLYFIAPSGHGLKPLDVEFMQRLGDKVNIIPVIAKADTMTPEECKDFKEQILKEIAQHKIKIYEFPESTGEEGEGADTNRALRARVPFAVVGANTVIEQDGRRIRGRKYPWGIAEVENLEHCDFLALRNMVIRTHLQDLKDVTSSVHYENYRCRKLAGLTHDGQPHGLNSNNFCPQGLMNSFMTVWNPLAQMEEEKREHDLKMKKMECEMEQVFDQKAREKHAKLKESEAELARRHEATRRALEAQARDLEERQRALRAEQAAWERDTGLSLDDLRRRSLEANSKETVDGKDKKDKKKKGLF-