Monarch geneset OGS2.0

DPOGS214494
TranscriptDPOGS214494-TA4635 bp
ProteinDPOGS214494-PA1544 aa
Genomic positionDPSCF300122 + 276215-294922
RNAseq coverage228x (Rank: top 44%)
Annotation
HeliconiusHMEL0139301e-17373.25% 
BombyxBGIBMGA013394-TA0.077.61% 
DrosophilaCG8683-PB0.046.44% 
EBI UniRef50UniRef50_E2BVY30.052.78%Protein MON2-like protein n=10 Tax=Formicidae RepID=E2BVY3_HARSA
NCBI RefSeqXP_393240.30.051.45%PREDICTED: similar to MON2 homolog [Apis mellifera]
NCBI nr blastpgi|2420054330.048.81%guanine nucleotide-exchange, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420054330.048.64%guanine nucleotide-exchange, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00054881.8e-13binding
KEGG pathwaysbi:SORBI_02g0365105e-22 
 K13462 (MIN7)maps-> Plant-pathogen interaction
InterPro domain[780-1341] IPR0160241.8e-13Armadillo-type fold
[874-951] IPR0154035.2e-10Domain of unknown function DUF1981, SEC7 associated
Orthology groupMCL13346 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214494-TA
ATGGCTTTTGTCAGTGCTGTTACCGGCGATGACTCCACGAAGAAATTCATGGATGTCTTACAAAATGATTTCAAAACTCTCAGCTTAGAAACCAAGAAAAAATATCCTCAGATAAGAGAGGCTTGTGATGAAGCCATAGAAAAACTAGCTTTGGCATCAAATAATCCACAAGCCTCGCTGTATGGTGTTGTGAACCAAATCCTGTATCCATTAGTCCAAGGATGTGAATCAAAAGACGTCAAGATTATTAAGTTCTGTCTCGGAACTATTCAGCGATTGATAGCCCAACAGGGAATAGATGCAAAAGGTGCTCGGCACATAGTGGACTGTCTGTACAACCTCGGCCATTCGGGAATGTTGGAATTAAAGTTACTACAAACAGCCGCATTATTGATGACAACCTCCGATCTAGTTCACGGAGACACCTTGGCCAGGACTATGGTTCTCTGTATACGAATGGTGTCTACTACTGAGACTCGGGATATCAGTACAAGTCACGCGGCTGTGGCCACAGTACGACAGCTTGTAGCACTTGTCTTTGAAAGAGCTTTAGCGGAAGCTAACGGAACGTTAAAAGTAAATCCAGCGGATGTGAGAATACAAGCAAACAGCAAAGCGCCTAAAGAGTTGAAACCGTGTGCTGTTGATGCATATCTTATATTACAAGATATAATACAATTGATAAACGGTGACGCTGCTAACTGGCTTGTGGGAATATCAGATGTACCAAAGACCTTCGGATTAGAGCTCCTTGATACAGTATTGACAGATTTTTCTGATGTTTTCTTTAAGATTTCAGAATTCCGTTTCCTGTTGAAGGAGCATGTGTGCGCATTAATTATAAGATTGTTCTCGCCTAATGTTAAGTACCGGGCTGCCTTCCCGTCTCCCCACATCCCGGGCGGCGGCGCGGCCCCGGGGGCGGAGCGGCCTCATTTTCCGGTGACCATGCGACTGCTGCGGCTCGTGTCGGTCATCGTACATAAGTATCACGACGTACTGATGACCGAGTGCGAGATCTTCTTATCCCTGAGCATCAAGTTCCTTGATCCTGATAAGCCGCTATGGCAGAGGGCGCTGGCGCTGGAGGTTCTACATCGGATGACCGTACAGCCTGATCTACTGAAAGCGTTTTGCGAATGCTACGACATGAAGCCGCACGCGACCAATATATTTCAAGACATAGTGAACGCTCTCGGAGCGTATGTGCAGAGTCTGTTCGTGGCGTCCCAGGTCAATACTTCAGCCGGCTCATCGAGTATCCCTCAACAGGCTGGTTTCTACTGGAAGGGAGTCTGGTTACCGCTGTGTGTGACCTTCGAACCGGGCGTGGCGAAATCTGTATATATAGAGATGTTGGACAGGACGGAGGCGCCCAGCATCCAGGAAGGCTACGGCATCTCGGTCGCTTACGCCTGCCTCGTCGAGATAATACGCTCCATAGCTATCACCATCGAGGGAGAGGAGTACTTTAGACTTCAAGAACTCTACGACGATACACAGAACGACGACGAGAAAGAGGTTAATTCGAATAGGACGACTAATAATAATATGAAAGATAGTAATAGTAAAGCGGACACCAAGGAGTTAATAAACAACGGTCATTCTACCGCGGACCAAAACTCTAACGTGATGAAAATACAACCAGACGACGAGGACAGGGAGAGACAGCTGAAACTACAGCTGATCAAGTCATCGTGGTGTGGTCTAGTGTGGGGACTGTCGGTGCTAGCGGAGGCCAGCATCGGGGAGTTGGAGCACGTGCTGCGAGCCGTGCAGACCCTGGCTAGAGTTAGTGGGAAGATGGGCGTGACCAACGCGCGTGACGCATGCGTGGGCGCATTGTGTCGGTGTGCGTTGCCCGCGCAGTACTGTGTCCCTGTCCTGGGCGCTCTGGCCGCCCTGGCGTGTCCCTGGCCCGGGGCCCGACCCCCAGCCCCCGCCCCGGACCTAAGACACCACGTGGTGTGGGTCGGCACCCCGCTTCCGTGCTCGCAGCCGACAGGTCAGCAGCAGTCGTTTGTGATGGTGACGTCACGCCACGTGTCCGCCCTGAGAGCACTGTTGACGGCTGCCGCTCGGGACGGAGACGCCCTGCAGCACGCCTGGCTGCCGGTGCTGACCACGTTGCAGCATCTGGTGTGGATCCTGGGTCTGAAGCCGTCTACCGGCGGCAGTATGAAGGCGAGTCGGGCGAGCGCTGACGCCAACGCTGTCATGAGCACGTCCGCGGTCATGGCCGACCTGCCGGGTCAGCAAGTCCCCGTGGCGGAGTCGCTGGGTGTGATGAGCGCGTTGTCGGCCATGTTGTCACGTGTTTTCGAAGCGTCCAAGAACTTGGATGACGTGGCCCTTCATCATCTGATCGACGCGTTATGCAAGCTGTCCAACGAGGCGATGGAGTTGGCTTATTCTAATAGGGAGCCGTCCCTCTTCGCTGTGGCCAAATTGTTGGAGACGGGTCTAGCCAACATGCACCGCATAGAGGTCATGTGGAGACCCATCACGAATCATCTCCTGGAGGTCTGCCAGCACCCTCACATCAGGATGCGGGAGTGGGGGGTGGAGGCCATCACCTACCTGGTGCAAGCGGCCTTCCAATACCATCACAATCATCCTGAACTCGTCACTGAGGCCCGTGAGCGTCTGGTGCTAGAACCTCTGGGAGAGCTGTGCTCCGTTCGTCACTGTGACGTAAGAGCTAGACAGCTGGAGTGTGCTGCGAGACTGCTCCACTCCAGAGGCGACCAGCTGGGAGCCGCCTGGCCGCTCATGATGGAGATCATATCGGCTATCGGCGACCATCATAGTGAGCAACTGGTGCGCTCAGCGTTCCAATGCGCCCAGCTGGTGGCGGGTGACCTCCTGGGATGTGCAGGTCCCAGGTGTCTCCGACGAGTGCTGGCCGCTGCGGCCGCCTTCGCCAGACAGACCAAAGAATTGAATATCAGTCTCACAGCTGTAGGACTGATGTGGAACATCTCGGACTACTTGTACCACAACCGCGACAAGCTGTCAGCGGCGCTGGTCAACGAGTCGGTGCCGGATGTCCAACCCGACCTTCCGCCTCTGGATCGACTGTGGATGTGTCTCTACATACGACTCAGTGAGCTGTGCACGGAGGCCCGGGCCCCGGTCCGTCGCGCCGCCAGCCAGACCCTGTTCAGCTGTATCGGTGCTCACGGGTCCCTGCTGGGCCGGCCCGCCTGGCGATCACTCCTGGCCGTGCTGTTCCCCATGTTGGACCAGGTCCGGAGGCACTCGGACGTGGCCAGTTCGGAAAAGGTGGACACGGGGGAAGTGTTGACGTTGTCCGGGGTGTCCCGCGTGTTCCACTCCAGGTTCCAGCTGTTAATGACTGTTGGTGACTTTATCCGCTCGTGGGTCGCTCTACTAGACTACATCACAGATTTCGCGCTCAGACGAAGTCACGAGGTGTCGGTGGCTGCTCTCAAGTCGTTCCAGGAGGTGGTGTCGGCTGCAGGTCGAGCGGAGGGCGAGGTCCCGCGCCGCGTGTGGTCGGCCGCCTGGAACGCCTGGACGGCCATCGCCACGGGGCTCGCGACTCCGCCTGGGTGTGTGGACGACAAGCCCGCGGAGCTGTTCTCACCGTCGCTGAACTTCCTCACCACCCTGTCACAACGATATCCTACACGTCCCCAGGAAGCGTTGGTCCGTCACGAGTTACTGCCAGCTATGTTTGGTGCCCTGACGTGTCTCGCGGCCGCCGCCTCCGAGCAGCCCTCCGCGGCCGTGCGCTGCCTGGCCGCGGCCGCCAACCTGTACCGAGCGGCGCCCGCTCCCAGCGCGCACCAACTGCCTACACTCATGAAGGCGCTACACTCGGCTGTCCGCCTGTGTCCGGAACGACGTCGCAACGAGCGAGACAGAGAGGGAGGGGACGAGCCCGCGCACACCACCGCCCTCTTGTTACAGGTGCTAGCGACGGGGCTGCCTCTAGCGCGAGAGCATCCAGACGATTACAGCGAGTTCTGGGAGATGCTGCCCGAAGTACTGGAGACATTCATGTTCGAACCGCCAGTGGGTGGTAGCGCTCAGGCTTGTGAGGTGGTGGGTGTGATCCGAGATGAAGTGCTGCGAGGGATCCCGCGACCTCCTCAGCGACCGGCGACAAGACTGCTGGCGCTCGTGAGGGCTGGTTCCATGCATCACACCAGACCTCACACTGTACTCACTAGAGATCAGAACGAGCAAGAGTTGAAGGAGAGAGAGGAGTTTGCTAGAACATGTTTCGAGACTCTGCTACAGTTCTCTATGCTGGAGGACATGGACACACTCACCACCGCTGAAAACGACAGCGATCCCCTGGCGATAATGCCTCTACTGGACCGCTTCCAGGAAGTTATAGCGAAGTACAGTAGAGACGAGGAGAGTACGGAACCTATACCCAGACAACAAGTGTCCGAGGTGTCGTTCGTCCTGCGGGCGGTGGCGTCTCTAGCCGGTGCTATGTTGAGGGCGCCGCGAGGGAGAGTCGACGAGGCGGCCTGGGAGAAACTTATCGGCGTGTATCCGTCACTCGTCCGTCTGTCAGGCGGGGCGCGGGCGGGGGCGGCGGGGGCAGCCCTGAGGGAAGCCCTGATGCAGTTCGGAGCGCTACTGGCGCCGCCCTAG

Protein sequence:

>DPOGS214494-PA
MAFVSAVTGDDSTKKFMDVLQNDFKTLSLETKKKYPQIREACDEAIEKLALASNNPQASLYGVVNQILYPLVQGCESKDVKIIKFCLGTIQRLIAQQGIDAKGARHIVDCLYNLGHSGMLELKLLQTAALLMTTSDLVHGDTLARTMVLCIRMVSTTETRDISTSHAAVATVRQLVALVFERALAEANGTLKVNPADVRIQANSKAPKELKPCAVDAYLILQDIIQLINGDAANWLVGISDVPKTFGLELLDTVLTDFSDVFFKISEFRFLLKEHVCALIIRLFSPNVKYRAAFPSPHIPGGGAAPGAERPHFPVTMRLLRLVSVIVHKYHDVLMTECEIFLSLSIKFLDPDKPLWQRALALEVLHRMTVQPDLLKAFCECYDMKPHATNIFQDIVNALGAYVQSLFVASQVNTSAGSSSIPQQAGFYWKGVWLPLCVTFEPGVAKSVYIEMLDRTEAPSIQEGYGISVAYACLVEIIRSIAITIEGEEYFRLQELYDDTQNDDEKEVNSNRTTNNNMKDSNSKADTKELINNGHSTADQNSNVMKIQPDDEDRERQLKLQLIKSSWCGLVWGLSVLAEASIGELEHVLRAVQTLARVSGKMGVTNARDACVGALCRCALPAQYCVPVLGALAALACPWPGARPPAPAPDLRHHVVWVGTPLPCSQPTGQQQSFVMVTSRHVSALRALLTAAARDGDALQHAWLPVLTTLQHLVWILGLKPSTGGSMKASRASADANAVMSTSAVMADLPGQQVPVAESLGVMSALSAMLSRVFEASKNLDDVALHHLIDALCKLSNEAMELAYSNREPSLFAVAKLLETGLANMHRIEVMWRPITNHLLEVCQHPHIRMREWGVEAITYLVQAAFQYHHNHPELVTEARERLVLEPLGELCSVRHCDVRARQLECAARLLHSRGDQLGAAWPLMMEIISAIGDHHSEQLVRSAFQCAQLVAGDLLGCAGPRCLRRVLAAAAAFARQTKELNISLTAVGLMWNISDYLYHNRDKLSAALVNESVPDVQPDLPPLDRLWMCLYIRLSELCTEARAPVRRAASQTLFSCIGAHGSLLGRPAWRSLLAVLFPMLDQVRRHSDVASSEKVDTGEVLTLSGVSRVFHSRFQLLMTVGDFIRSWVALLDYITDFALRRSHEVSVAALKSFQEVVSAAGRAEGEVPRRVWSAAWNAWTAIATGLATPPGCVDDKPAELFSPSLNFLTTLSQRYPTRPQEALVRHELLPAMFGALTCLAAAASEQPSAAVRCLAAAANLYRAAPAPSAHQLPTLMKALHSAVRLCPERRRNERDREGGDEPAHTTALLLQVLATGLPLAREHPDDYSEFWEMLPEVLETFMFEPPVGGSAQACEVVGVIRDEVLRGIPRPPQRPATRLLALVRAGSMHHTRPHTVLTRDQNEQELKEREEFARTCFETLLQFSMLEDMDTLTTAENDSDPLAIMPLLDRFQEVIAKYSRDEESTEPIPRQQVSEVSFVLRAVASLAGAMLRAPRGRVDEAAWEKLIGVYPSLVRLSGGARAGAAGAALREALMQFGALLAPP-