Monarch geneset OGS2.0

DPOGS206600
TranscriptDPOGS206600-TA4512 bp
ProteinDPOGS206600-PA1503 aa
Genomic positionDPSCF300048 - 1335185-1377514
RNAseq coverage13x (Rank: top 83%)
Annotation
HeliconiusHMEL0088300.084.54% 
BombyxBGIBMGA008326-TA2e-9296.43% 
DrosophilaCG42629-PB0.086.28% 
EBI UniRef50UniRef50_UPI00020626A20.060.60%UPI00020626A2 related cluster n=2 Tax=unknown RepID=UPI00020626A2
NCBI RefSeqXP_550681.30.086.88%AGAP002115-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3454794010.076.83%PREDICTED: hypothetical protein LOC100122885 [Nasonia vitripennis]
NCBI nr blastxgi|3838582450.068.61%PREDICTED: uncharacterized protein LOC100880942 [Megachile rotundata]
Group
Gene OntologyGO:00056222.6e-59intracellular
GO:00510562.6e-59regulation of small GTPase mediated signal transduction
GO:00050962.6e-59GTPase activator activity
GO:00055154.5e-06protein binding
GO:00082707.5e-05zinc ion binding
KEGG pathwaymcc:7159581e-60 
 K08013 (SIPA1)maps-> Leukocyte transendothelial migration
InterPro domain[844-1033] IPR0003312.6e-59Rap/ran-GAP
[1263-1328] IPR0113331.5e-15BTB/POZ fold
[1103-1328] IPR0002104.5e-06BTB/POZ-like
[1267-1326] IPR0130697.7e-06BTB/POZ
Orthology groupMCL13116 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206600-TA
ATGACGGCGATATGCTTCGTCTGCAATTTACCGATACTATCACACCAAGTTGGTTTGGTGTGGGCAGGAGGGAATGGATGGGACGAACTCACAAGATCCAATTTGGAAGAATCTACCCTTAGACAGCGGCTTGGGAACCGCAGGGATTCTGCAGCCCAAGTGTGTTGCTTATCACCACCAGAGCCCGATGTCTCGGATCAACCCAGGGGTGGTCCCCGCCGTCGTCGTTCGTCGCTTGCTCAACTTACCGACATACTTCGCGAATGGGGTGGTGGGAGCAGCGCACCCAGACGTCAAAGAGCTCCCTTATCCAGACGAGAGACATTGGCAGACTTAGCTCGCTCCTTACCTTGGGCCCGACATGAACCACCCCCTCGTCGTCGACGCGAATCGTCAGCGGATTCTGGAATTAAATCCTCAGTGTCTAAACGCCGGGATTCCAGAGCTGTCCCTCATGATTTTAAAACAGATTTTCCCAACAGCTTAGATCGCAAAGATTCTATAGCTGTAATGACAAACAGAGAACGCAGGGTTAAACGAAGAGGATCTGGAGATGGTAAAGATCGGCGTGATTCGGTCGATGGTCACCGACATCGACGTGATTCCTTAGCGACTCCCCCACCGCGTATAGTCGCCACTCACAAAAAGAGACGAGGTTCTCCACCGACACCGGGTCCGCCATCTTTCGTAGATGCATCATCCGATCCTGGTCCATCAACTCATCCGCCCTCTGTGACAGTAGTAACAGTGCACGAGCCAAGTCGCTCAGATGGAGAATGCCCACCAATGACTATGCCAACAATTATCACATCAGCAGTAACACCTTCGCCAACTTCACCAACTGTACCCACATCCAATGCAACACAGACTACTCAAGCCACTCAGGCTACCCAGTCTGCACAGGGTACTCAAGTTACTCCTGGTGGAAAAACACCTCCACAATTAGGTGGCAGGCGAGATTCTACCACCCAATGTGGTCGAGCTAGACGAGATTCACGAGCTACGGCTAGTCCAGAACGAAGATTAGGCAGATTACAAAGACAAGCAACTGCTTTTGATGACCCCACTGGTCCACCTGGAACCCGTCGCAGAGATTCCGGACCTACACTGGAACCGGATGATGCAGGCCGAGCCAGACGTGACTCGTTAAGTCCTGACTCAGCACGGCCAAAAAGGGAAAGAAACCAGTTGAGTCCTGATCGCGCAGGGGGTGGTGAACTTAGCCCATCAGCCGCTCGTCGAAGAAGTCGCCTAAGACGCCAAGCTTCATGTGCCAGAGTTGGTCGAGCAAGAAGTCCTGAGTCGAGCTCGTGTTCAAGCCGAGATCCAAGCCCTTGTGCACGACCTCCTGAGAGAACGATGTTTAGGAGACAATCGACCACGGAAGAGATATTGATAGCACGTGGTTTTAGACGTCAATCTACAACAGAAGAAATGATACGATGTCGTAATTTTCGAAGACAAAGCTCCCAGAGTGATGACGCTTGTATGCGTGCTCGTGGTCGCCGAGATTCTTCAACACAAATACTCGATGGCACTATTGGTACCATGACGGTAGAAACTACGAGTACATTCTTTGACTCCAGTACACAAACTGAGCCATCCCCGTTGTACGACAACAACCATTATCACGAGGAATGCCTTCGTTGCAACTCCTGCGGGTTGAATCTCACCGGACCAAATCAAAAACGTGCTCGCCGTTTCAAGAATCAGATTCTCTGCGATTTGCACTTCGCGGACGTGGCGCTGATGGAGTGCAGTGACTTCATGCAACAGCTTCGCAGCTTCAAACCCCAGTCGCTTGGATGTGCTGTCGCCAGGCGAAAGTCGTCGACTACCCTCATATTCCCTTTGCCTCCTCAAGCTTGCTCAGATGAGTTCTGCGAAGAATACCCTCACAATCTGATTCCAACACCGGGATACTGGATCGAGTGCTCGCGTCAGAAGATTACCACAGACACGATTTGGGATGAATCAGAATCTGAACACGACAGCGGTCCAGATAGAGATGATTCAGATCGACGTAGGAGATCGGGTTCCTTGGACGAAGCCGCAGAGGATAGCAGCGATAACGGTGGAAGTACGCCGAAGAAGAAAACAGCTATCGAAGAACAATGGGAAAGATCTGGTGGCTTCGAACTTACATCAGTTGAACAGGAAACTTACGAAAAATACTTCTACGGCACTGAACATTGGAACTATTTCACCAATGACGAAGACTTGGGCCCCGTTATACTTTCAATAAAGCAAGAAACTCTTAATGCAAGAGACCAATTCAGAATTCTCGTCCGAGCGATCAGCTATACTGTACATGGCCTAATACCCGCATCCTGTGTCTTCGCTGACCGCTACAATCGCGAAGAAGTGGTCAGATCTCTAGGCAAAGAGGTCAATATCAACCCACCTCTCATGCTGGGACAACTGCCCGATACCCCTGAGGAACTACTAAAATTGGACCAAGTTTTTATAAAATCAGAACTGAAAGTTGGTGTGATATATGTGAAGGAAAATCAATACACCGAAGAAGAAATTTTGGACAATAATGAGAACTCACCACTTTTTGAAGAGTTCTTACAAGTTCTCGGCGAAAAAGTTCGTCTTAAAGGATTCGACAAATACAAAGGCGGGCTTGATACCGTTCACGATCTCACAGGATTGTATTCAGTTTACACAAACTGGAGGAGCATTGAGATCATGTTTCACGTTTCAACTCTTCTGCCGTATGAGAAACATGACGCACAGAAATTACAACGAAAACGTCACATCGGAAATGACATTGTATGTGTCGTATTCTTAGAAGCCGATAATACAGCCTTTTCACCAGCCTGCATAAAGAGTCATTTCTTACACACATTTATTTTAGTGCGAGTGTCTGCTAAAATTAAAAGGCGTCCCACTAGATATGAAGTGTCGGTCGTGACCCGAGACGAGGTGGGAGCTTACAAGCCCTACTTGTGGGAGCAAAGTGTGTTTGATAAAGGACCTATGTTCAGAGAATGGTTACTTACTAAAATTGTAAATGGTGAGCGAGCTTCGTATTCAGCACCTAAGTTTGCTCGAATGCAGGAACGTACTCGAAGCCAAATGTTAGAAGACATAGTCGCCAATTTGCAGAATCACGCAGAAACTGGACAGATCCCTAAGCCTTACCGACGAGGATCTTGGCGTCCAATTGGTCACATGCGACCGTCATCGCCATTGTTAGACTCCGTTAGGGATCAGTTCGAGGACTACGACCAACTGGCCAAAGATTTTACAAGAGTTTTCCTCAACAGTGAACTAAATGCTGCTCAAAATGCACAACTTTTCGATGTAGTATTCATGGTTGGGCAATCTAAACAGAAAACGAAATTTATCGGTGTCCGTGCAATACTAGGTGTGAGGAGCAGGGTATTCCAAGAAATGCTGTACGGCATACAAACTGGTTTCGGCTCCCCTCAAGTGCCGGTAGCTGAACTGTTGGCTCGACCCGCACCCACGCTTCTGTCTCCCACGCCACGACAAAAAAGTAGCAACTTCTTACAAGTACCTGACATTGAATCTCCAAGACCCAAAAGCGTTCCCAGTTCTCCTATGGTTAAACGTGCTTTCTCCCGCCTCGGCACCATAACAGCTGGATGGGGTCGATCTATCAGGAAACAACATTCTCAACTCAATGTTGATGATAAGAAAAAATGGGCTAGCTCACAAGACTGTTCAAATAAAGAAAGTAAGGACAAAGACAAAGAAAAAAATGCAGCTTTGGCAGTTCCACGTCTCTCAGTGTGTGCTGATGCTCAAAAAGTAGACCGAGCAAAGCTTGCTCAAACTGAGTTCTCGATAATCGAATTCGATCCTGAGACCTTTCGGATCTTGCTGGACTACTTACACACGGGTAGCTGTCCTCTCACTTGCGCCTCCATACCGGGACTCATCTGTGCTGCGGAACACTACGACCTGCCTGAACTTCTGCAGGCTTGCTTCCATCACGCGAAGCAGTTCCTTAGGATTGAAGTGGTCTGTACCATGCTGATCTCGTTGGAAAATTACTACTGGCGTTACACATCAGCTTCTGAGCTGGTCAATATGATTCTGGCATTTATAGAACAACGGGCATATGCTCTTTTCCAAACTTCGGAGTTTCTTAACCTATCCGAATCGATGGTACAGATGATAATGTGCAGGAACCTGGAAGTACCAGAAGTTAGAAAATTCGAAGCAATGCTTAGTTGGGCTCGCAATAAAATCAAATCAAGGTCTACGAACAAGACTGACGCCAAGAATGAGTTCAAATGTATAATGGAGCGATTAGCTAGAGATTTGAAATTATACAGGATCTCACCCCAAGAATTGATAAAAGTGGTTCTTCCCTCAAAAGCAATCAAAAATGAACGTATCCTCGAAACGTTGATGTATCAAGCCAACTCGGGAATGTACAGAATTCAGGACAGCTACATCGAGGCTTGTCAGCAACGCCTACAGAAACAGGATTCGAGATTCTCCGAATTTGAAAGTTTTGATTACGGTATATAA

Protein sequence:

>DPOGS206600-PA
MTAICFVCNLPILSHQVGLVWAGGNGWDELTRSNLEESTLRQRLGNRRDSAAQVCCLSPPEPDVSDQPRGGPRRRRSSLAQLTDILREWGGGSSAPRRQRAPLSRRETLADLARSLPWARHEPPPRRRRESSADSGIKSSVSKRRDSRAVPHDFKTDFPNSLDRKDSIAVMTNRERRVKRRGSGDGKDRRDSVDGHRHRRDSLATPPPRIVATHKKRRGSPPTPGPPSFVDASSDPGPSTHPPSVTVVTVHEPSRSDGECPPMTMPTIITSAVTPSPTSPTVPTSNATQTTQATQATQSAQGTQVTPGGKTPPQLGGRRDSTTQCGRARRDSRATASPERRLGRLQRQATAFDDPTGPPGTRRRDSGPTLEPDDAGRARRDSLSPDSARPKRERNQLSPDRAGGGELSPSAARRRSRLRRQASCARVGRARSPESSSCSSRDPSPCARPPERTMFRRQSTTEEILIARGFRRQSTTEEMIRCRNFRRQSSQSDDACMRARGRRDSSTQILDGTIGTMTVETTSTFFDSSTQTEPSPLYDNNHYHEECLRCNSCGLNLTGPNQKRARRFKNQILCDLHFADVALMECSDFMQQLRSFKPQSLGCAVARRKSSTTLIFPLPPQACSDEFCEEYPHNLIPTPGYWIECSRQKITTDTIWDESESEHDSGPDRDDSDRRRRSGSLDEAAEDSSDNGGSTPKKKTAIEEQWERSGGFELTSVEQETYEKYFYGTEHWNYFTNDEDLGPVILSIKQETLNARDQFRILVRAISYTVHGLIPASCVFADRYNREEVVRSLGKEVNINPPLMLGQLPDTPEELLKLDQVFIKSELKVGVIYVKENQYTEEEILDNNENSPLFEEFLQVLGEKVRLKGFDKYKGGLDTVHDLTGLYSVYTNWRSIEIMFHVSTLLPYEKHDAQKLQRKRHIGNDIVCVVFLEADNTAFSPACIKSHFLHTFILVRVSAKIKRRPTRYEVSVVTRDEVGAYKPYLWEQSVFDKGPMFREWLLTKIVNGERASYSAPKFARMQERTRSQMLEDIVANLQNHAETGQIPKPYRRGSWRPIGHMRPSSPLLDSVRDQFEDYDQLAKDFTRVFLNSELNAAQNAQLFDVVFMVGQSKQKTKFIGVRAILGVRSRVFQEMLYGIQTGFGSPQVPVAELLARPAPTLLSPTPRQKSSNFLQVPDIESPRPKSVPSSPMVKRAFSRLGTITAGWGRSIRKQHSQLNVDDKKKWASSQDCSNKESKDKDKEKNAALAVPRLSVCADAQKVDRAKLAQTEFSIIEFDPETFRILLDYLHTGSCPLTCASIPGLICAAEHYDLPELLQACFHHAKQFLRIEVVCTMLISLENYYWRYTSASELVNMILAFIEQRAYALFQTSEFLNLSESMVQMIMCRNLEVPEVRKFEAMLSWARNKIKSRSTNKTDAKNEFKCIMERLARDLKLYRISPQELIKVVLPSKAIKNERILETLMYQANSGMYRIQDSYIEACQQRLQKQDSRFSEFESFDYGI-