Monarch geneset OGS2.0

DPOGS212337
TranscriptDPOGS212337-TA4947 bp
ProteinDPOGS212337-PA1648 aa
Genomic positionDPSCF300019 - 344845-351173
RNAseq coverage502x (Rank: top 25%)
Annotation
HeliconiusHMEL0039440.088.26% 
BombyxBGIBMGA004647-TA0.092.54% 
Drosophilasec71-PA0.067.19% 
EBI UniRef50UniRef50_Q9VJW10.067.19%LD29171p n=31 Tax=cellular organisms RepID=Q9VJW1_DROME
NCBI RefSeqXP_002057461.10.066.80%GJ18143 [Drosophila virilis]
NCBI nr blastpgi|3407297490.067.30%PREDICTED: brefeldin A-inhibited guanine nucleotide-exchange protein 2-like [Bombus terrestris]
NCBI nr blastxgi|3123815470.072.89%hypothetical protein AND_06144 [Anopheles darlingi]
Group
Gene OntologyGO:00320122.1e-95regulation of ARF protein signal transduction
GO:00056222.1e-95intracellular
GO:00050862.1e-95ARF guanyl-nucleotide exchange factor activity
GO:00054882.4e-22binding
KEGG pathwayppp:PHYPADRAFT_1171454e-178 
 K13462 (MIN7)maps-> Plant-pathogen interaction
InterPro domain[621-808] IPR0009042.1e-95SEC7-like
[697-812] IPR0233946e-52SEC7-like, alpha orthogonal bundle
[78-1401] IPR0160242.4e-22Armadillo-type fold
[1098-1142] IPR0154033.3e-09Domain of unknown function DUF1981, SEC7 associated
Orthology groupMCL11429 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212337-TA
ATGCAAACCAATCCCAAAACTAAAGAAATGTTTATAGTCAGAGCTTTGGAAAAGATATTAGCAGATAAGGACATAAAAAGGTCCTATCACAGCCAGTTGAAAAAGTCATGCGAAGTAGCGCTAGAGGAAATAAAAACTGAGCTGAAAAATGGTGGTCAGCCAGAAACATCAGAGAGTCCAACATCAGGAACTTTACCACTACCCAAAAATGATGCATCAAATATCATCACTGCTGAAAAGTACTTTCTCCCATTTGAACTGGCCTGTCAAAGCAAAGCAGCCAGGATAGTTGTGACTGCACTGGACTGCCTTCAGAAATTAATTGCATATGGCCATCTCACAGGAAATATTCCAGATTCAACAACTCCGAGAAAATTGTTAATAGATAGAATAGTTGAAACGATTTGTAGTTGTTTTAATGGCCCACAAACTGATGAAGGAGTGCAACTTCAAATAATCAAAGCTCTTCTTACTGTGATCACCAGCCAGCATGTTGAAGTGCATGAGGGTGCAGTACTTCTAGCTGTCAGAACATGTTATAATATTTATTTGGCATCTAAAAATCTTATAAATCAAACAACCGCCAGAGCTACTCTAACACAAATGTTAAATGTGATATTTACTAAAATGGAAAATCAAGCTTTAGAGTCGGAAGCTAGCAATTCTAACCTAGCACCAGAGACACAACACAAAATTCCTAATGGGAATATTTCAAGTGATGGGACATCCTGTGCAAAAAATGAGGATAATAAAGTAGAATCTAGTGAGAAGGAAGTTGATGAAGTTCTAGAAGCTAAATTAATAGCTAGACAAATAGTGGATTCTGTCATAGATAATGCCATCTCAATAGCAGCAAAGAAAACTGTTCAAGATGTGAGCCAGAATGGACCTGAAAACAATGAAAACCCACCTGACAGTCAGGACAATGTCAGTATTTCCCAAGAGAGCAATGGGCACCTTCACCCTGATACAACAATAGCAAGGATTCCATCACAAGAAAGTGTAGACGTTGCTTCAGAGAATGACACATCAGTTACTGCTAAATTCACTCATGTACTACAAAAAGATGCTTTCCTTGTATTCAGAGCTCTTTGTAAGCTCTCTATGAAGCCTTTACCAGATGGTACACCAGATCCAAAATCCCATGAGCTGAGGTCAAAGATTCTTTCTCTTCATCTGTTGCTGTCTATCCTTCAAAATGCTGGCCCTGTTTTTAGAAATAATGAAATGTTTATAACTGCTATTAAGCAATATTTATGTGTTGCCTTATCCAAAAATGGAGTCAGTTCTGTACCTGAGGTCTTTGAACTTTCACTGGCTATTTTCTTAGCCTTACTACAAAATTTTAAAGTTCATCTAAAGATGCAAATTGAAGTGTTCTTTAAAGAGATTTTCATGAATATTTTGGAAACTTCAAGCTCCTCCTTTGAACACAAATGGATGGTAATTCAAGCCCTCACTAGAATATGTGGTGATGCCCAAAGCGTTGTCGACATTTATGTTAACTATGACTGTGATCTATCAGCTGCCAACCTATTTCAAAGACTAGTTAATGATGTGTCCAAAATAGCACAGGGAAGACAAGCTTTGGAATTAGGTGCTACACCAAACCAAGAAAAGTCTATGAGAATTAGGGGCCTTGAATGCCTCGTGTCAATATTAAAATGTATGGTAGAATGGAGCAAAGAGTTATACATCAATCCCAATATGCAAACTACATTGGGTGAGAGACTGGTTAAAGAAGACACTGATCATCAAAGTATCAAATCTCATGGTGGATCGAGCCTTAGCTTGGTTTCAACTGGATCTAGTAACATTGGCAACCGAGAGACCTTGGATTCACCTGAACAGTTTGAAGTTTTGAAACAGCAAAAGGAAGTTTGGGAAACCGGTATTGACCTGTTTAACAGGAAACCAAAAAAAGGAGTTACATTCCTACAAGAACAAGCTTTACTAGGAACATCCACTAAGGAAATTGCTGAATGGTTACTAACGGACGAAAGACTTGATAAAACGTTCATTGGAGAATATTTAGGTGAAAATGATGATCATTCTAAAGAAGTTATGTATGCTTATGTTGATTCTATGAAGTTTTCTAACATGGACATCGTAGCGGCTCTGAGACATTTCCTGGAAGGTTTTAGACTACCTGGAGAAGCACAGAAAATTGACAGACTAATGGAAAAGTTTGCAGCTCGTTATTGTGAATGTAATCCAAATAATACACTTTTCATGAGTGCTGATACAGTTTATGTACTTGCATTTTCTATAATAATGTTAACAACAGATTTACACTCCCCACAAGTAAAGAATAAGATGACAAAAGAACAATACATTAAACTTAATAGTGGTATCAGCGACAATAACGACTTGCCGCGCGAATATCTGTCTCAGATATATGACGAAATAGCAGGACATGAAATAAAAATGAAAAACGTCTCGCGACCGGGCAAGCATATGATAGCGAATGAGAAGAAACGGAAATTCATATGGAATATGGAAATGGAACAAATATCAACGGCTGCTAAAAATTTAATGGAATCTGTATCCCATGTCCAAACGCCATTCACTACTGCGAAGCACGTTGAACACGTTCGACCGATGTTTAAGATGGCCTGGACTCCTTTCCTGGCTGCATTCTCTGTCGGTCTTCAAGATTGTGATGATCCCGAGATCGCGTCTTTGTGCCTGGATGGAATAAGATGTGCAATACGTATCGCGTGCATTTTCCATATGTCTTTAGAGAGAGATGCATACGTTCAGGCTTTAGCCAGGTTTACACTATTGACTGCAAATTCACCTATAACAGAAATGAAAGCAAAAAATATCGATACCATTAAAACCCTTATAACTGTCGCTCATACCGACGGAAATTACTTAGGATCAAGTTGGCTCGACGTCGTTAAATGTATTTCGCAACTGGAACTAGCTCAACTTATAGGCACAGGAGTTCGACCACAATTCTTGTCTGGGTCAGGCATTAAACCGCAACCAGATTCCCTGAAATTCAGCCTCATGTCCTTAGACCCTAGTGTTAAAGAACATATTGGAGAAACGAGTTCTCAAAGCGTGGTAGTCGCGGTGGACAGAATATTCACGGGATCTACGAGACTTGACGGTAACGCTATTGTTGATTTTGTCAAAGCTCTTTGCCAAGTGTCTCTTGACGAGCTAAGTCATCCTACGAATCCTCGAATGTTTTCCCTTCAAAAGATTGTGGAAATATCTTATTACAATATGGGTCGTATCCGTCTTCAATGGTCGCGAATTTGGCAAGTTTTAGGTGACCACTTTAATAAGATGGTCAACTCACAAGCACCAAACATTAAATCTGGATGGAAAAACATTTTTTCAGTATTTCATTTAGCTGCCAGCGATCAGGACGAAGCAATCGTTGACTTGGCGTTCCAAACTACTGGCAAAATCATTACGGAATTATATGAAAAACAATTTCCGGCTATGATAGACTCATTCCAAGACGCTGTTAAATGTCTATCCGAATTTGCCTGCAATGCTAAGTTCCCTGATACCTCCATGGAAGCGATAAGACTTGTCCGATCTTGTGCGACGGCAGTGGGAACGTCTCCACAGCTGTTCGCGGAGCACGCCGGCCTGGAAGGCGAACCTGGTGCTCCTGAGGTAGACAGGGTGTGGCTACGAGGATGGTTTCCACTATTATTTTCACTGTCGTGCGTCGTTAGCCGTTGCAAACTTGACGTTCGTACACGGGGACTGACGGTCCTCTTCGAGATTATAAAAACTCATGGCGATTCATTCCGTCCGCATTGGTGGAGAGATTTATTTAATATATTGTTTAGAATTTTCGACAATATGAAGTTACCTGAGCATCAATTAGAAAAGAATGAGTGGATGACTACGACGTGCAACCACGCTTTGTACGCTATTGTAGACGTCTTCACGCAATTTTTTGACATTCTAGGATCCCTTTTATTAGAACAGCTATATTCACAACTGCACTGGTGCGTGCAACAGGATAACGAACAACTCGCGAGATCTGGAACAAATTGCTTAGAAAATCTAGTCATATCAAACGGAACTAAATTTAACGAAGAAACGTGGAGCAAGACCTGTCAAATTATGTTGGATATATTTAATAGCACGCTACCCACTACACTGCTCACATGGAAACCTGACGAGAACGAAGACAGCGAACACCAGAACGTGCGCCACGGTATATTGAAGAAACCACAAGGTGGAGACGAAGTGAAGTCTTCGAACCGTGTGTTCAACAGTCTGTTAATCAAATGCGTAGTTCAGTTAGAGTTGATTCAGACAATCGACAATATAGTTTTCTACCCTGCGACGTCTCGAAAAGAAGACGCTGAAACCTTAGCCTTGGCTGCTGCCGAATTGACCGGAGGAACTCCTGGCACCGAACAAGAATGTCAACGTGAAGAACAAGGGATGTACAGACTTCTCAGCTCACCACATTTATTACGACTAGTCGAATGCCTGATGTGCAGCCATCGATTCGCTAAGACTTTCAATACAAATAATGCGCAAAGGAATGTGCTTTGGAAAGCTAACTTTAAGGGTTCCGTGAAACCGAATATGTTGAAACAAGAAACGCAATCCCTCGCATGCGTCCTTCGAATATTATTCAAGATGTATAGCGACGAAGCGCGGAGAAGTCACTGGCCTGCGGTCCAGAAAAGCCTCATCACGATATCCTGTGAGGCTCTAGAATATTTCGGTTCGCTCACCAACGAGGCGCACCGGGATGCTTGGACTTCTATTCTTCTCCTCATTCTCACACGAATACTTAAAATGCCAGACGAACGATTCGCTGCCCACGTGTCCAGCTACTACCCCATGTTATGCGAGATCACGTGTTTCGACCTGAAGCCGGAGCTCCGCTCGGTGCTGAGGCGGGTGTTCATTCGTATAGGACCCGTCTTCAACATCGTCAACGTCAACACGCAATAA

Protein sequence:

>DPOGS212337-PA
MQTNPKTKEMFIVRALEKILADKDIKRSYHSQLKKSCEVALEEIKTELKNGGQPETSESPTSGTLPLPKNDASNIITAEKYFLPFELACQSKAARIVVTALDCLQKLIAYGHLTGNIPDSTTPRKLLIDRIVETICSCFNGPQTDEGVQLQIIKALLTVITSQHVEVHEGAVLLAVRTCYNIYLASKNLINQTTARATLTQMLNVIFTKMENQALESEASNSNLAPETQHKIPNGNISSDGTSCAKNEDNKVESSEKEVDEVLEAKLIARQIVDSVIDNAISIAAKKTVQDVSQNGPENNENPPDSQDNVSISQESNGHLHPDTTIARIPSQESVDVASENDTSVTAKFTHVLQKDAFLVFRALCKLSMKPLPDGTPDPKSHELRSKILSLHLLLSILQNAGPVFRNNEMFITAIKQYLCVALSKNGVSSVPEVFELSLAIFLALLQNFKVHLKMQIEVFFKEIFMNILETSSSSFEHKWMVIQALTRICGDAQSVVDIYVNYDCDLSAANLFQRLVNDVSKIAQGRQALELGATPNQEKSMRIRGLECLVSILKCMVEWSKELYINPNMQTTLGERLVKEDTDHQSIKSHGGSSLSLVSTGSSNIGNRETLDSPEQFEVLKQQKEVWETGIDLFNRKPKKGVTFLQEQALLGTSTKEIAEWLLTDERLDKTFIGEYLGENDDHSKEVMYAYVDSMKFSNMDIVAALRHFLEGFRLPGEAQKIDRLMEKFAARYCECNPNNTLFMSADTVYVLAFSIIMLTTDLHSPQVKNKMTKEQYIKLNSGISDNNDLPREYLSQIYDEIAGHEIKMKNVSRPGKHMIANEKKRKFIWNMEMEQISTAAKNLMESVSHVQTPFTTAKHVEHVRPMFKMAWTPFLAAFSVGLQDCDDPEIASLCLDGIRCAIRIACIFHMSLERDAYVQALARFTLLTANSPITEMKAKNIDTIKTLITVAHTDGNYLGSSWLDVVKCISQLELAQLIGTGVRPQFLSGSGIKPQPDSLKFSLMSLDPSVKEHIGETSSQSVVVAVDRIFTGSTRLDGNAIVDFVKALCQVSLDELSHPTNPRMFSLQKIVEISYYNMGRIRLQWSRIWQVLGDHFNKMVNSQAPNIKSGWKNIFSVFHLAASDQDEAIVDLAFQTTGKIITELYEKQFPAMIDSFQDAVKCLSEFACNAKFPDTSMEAIRLVRSCATAVGTSPQLFAEHAGLEGEPGAPEVDRVWLRGWFPLLFSLSCVVSRCKLDVRTRGLTVLFEIIKTHGDSFRPHWWRDLFNILFRIFDNMKLPEHQLEKNEWMTTTCNHALYAIVDVFTQFFDILGSLLLEQLYSQLHWCVQQDNEQLARSGTNCLENLVISNGTKFNEETWSKTCQIMLDIFNSTLPTTLLTWKPDENEDSEHQNVRHGILKKPQGGDEVKSSNRVFNSLLIKCVVQLELIQTIDNIVFYPATSRKEDAETLALAAAELTGGTPGTEQECQREEQGMYRLLSSPHLLRLVECLMCSHRFAKTFNTNNAQRNVLWKANFKGSVKPNMLKQETQSLACVLRILFKMYSDEARRSHWPAVQKSLITISCEALEYFGSLTNEAHRDAWTSILLLILTRILKMPDERFAAHVSSYYPMLCEITCFDLKPELRSVLRRVFIRIGPVFNIVNVNTQ-