Monarch geneset OGS2.0

DPOGS211941
TranscriptDPOGS211941-TA2868 bp
ProteinDPOGS211941-PA955 aa
Genomic positionDPSCF300011 + 758044-771335
RNAseq coverage141x (Rank: top 55%)
Annotation
HeliconiusHMEL0051640.074.46% 
BombyxBGIBMGA000890-TA0.077.88% 
Drosophilasnz-PA0.042.11% 
EBI UniRef50UniRef50_E2AR000.050.74%Sorting nexin-25 n=7 Tax=Formicidae RepID=E2AR00_CAMFO
NCBI RefSeqXP_972788.10.050.39%PREDICTED: similar to CG1514 CG1514-PA [Tribolium castaneum]
NCBI nr blastpgi|3071992980.051.54%Sorting nexin-25 [Harpegnathos saltator]
NCBI nr blastxgi|3071992980.051.33%Sorting nexin-25 [Harpegnathos saltator]
Group
Gene OntologyGO:00048714.3e-16signal transducer activity
GO:00055155.5e-11protein binding
GO:00071545.5e-11cell communication
GO:00350915.5e-11phosphatidylinositol binding
KEGG pathway 
InterPro domain[99-261] IPR0031141.6e-31Phox-associated domain
[373-503] IPR0161372e-20Regulator of G protein signalling superfamily
[824-929] IPR0139375.6e-20Sorting nexin, C-terminal
[384-499] IPR0003424.3e-16Regulator of G protein signalling
[97-264] IPR0139961.3e-11PX-associated, sorting nexin 13
[649-735] IPR0016835.5e-11Phox homologous domain
Orthology groupMCL13046 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211941-TA
ATGACTAAATTTCTTCTTTATTTTCTTGTCCCCATATGCATATCATCTGTATATTATTTTTCTTTTCTTTGGAATTTTATGTACTTATCCATTCTTAGTGTTTGTCTCATAACTTGTGTTTTTGTGCTTACACTTTGTGGTCATCTCTCTTTGTCGTCTCCCCATCAAACTTCCACGTTATTGCTCGATAATATAGAGAAAGAAGCTCGACAATTGGAAACTGCAATACGGGATGATCAAAATAATTGGGTGTATGTCACGAAAAAGCCTCATCTGCCGGTTATTTTCGGTAGAACAGTTGACAGTCAACTCCAACTGCTAATAGATTATTTCATAAGGGACTTTGTAACTCAATGGTTGAAGGAGCTCTCACACAAACCAGAACCAGTCATTGACAAATTCAAAGAGCACATATGGACGGCAGTGCAGAACTTGTATGACAGACTGTTGAAAGTTGACGCTGAGAAGCTATTAGCCAATGACATGGTCACAAAAATCACACAACATTTTGAACGGATAAGGATCGCTAGAAGCTGTGCATTGGAACTAAATCAACCTCCAGTATTTGCTCTCGCCCCTCACCTGATGTCGAGCGACATGGAATTACACTACTTGAGGCAGATCAGCGAGTTCCTCGTAATGTTTCTGATGCCCAGATGCTACTCGCTCTCCCCCGTCAGTTATCTCATTAGGGAAATATTAGTGTGTAAAATTCTTCAGCCAGCCATAAATCTAGTCACGGAGCCAGATTACATAAACCAGAAGATAATACAGTATCTGGAGGCGCAGAAGGAGGTCGACGCGATGCATGTCAGAACCCACGAGTACGCCAAGACGTTCGAGGACTACATACGGCTCATAAACAGCTGTAATAATGTAGACACACTGAAGAGATTGCGTTACGACATAGTGACCCAAATCATGCAGGCCACCACTCTACAGAACGTGAAGCGGGCTAAAGGCATCGACATAGACGTTATCGAGAAAGGAGGGAACCACAATATAAGCAGACAGCAAGTGAGCGACGCCAGGAAGCTGAAGAGATACATAGACCAGCTCACCATCGCCAAGGACGAGTGCGAAAAGGCTCTAAGGAGGTTGGGGTGGGACGGAGCATTCCCAGCAGTAGAATCTGATAGTAAGGCGATGCCTCTCCACAAGGTCATATCGAGCGTGACAGGTCGCAAGTATCTGTCCATGTTCCTGGAGACTCTCTGCTCTCAAGGGCTGGTCGGTTACTGGGCGGCCGTGGACGAGCTTCGACACAGTCCGAGGAGTAACTGGCATCAGCTCGGTGCTGAGATCTTCTATACATACATCAGATCGCCCAGCGCTGAAGTTAAAGTTGACAAGGAAACCAGAAAGAGAATGGAAGGGTTTCTTCTTGGCGATAAAGGTCCGGAAGTGTTTTACGAGGTCCAAGATATAGTGGTCGATACCATACAAGACAAATATTATCATTCGTTTCTCCTAAGCGACCAGTACAAAGCTTTGGTCGCGGAACTGGCCACCGAGGAGGCGAGCGATCCAGGTTTATGCTCTGAGAGGTCTCCGATAGACGAGCGGCAGGGTTCTCGCGAGTCGTCTTCCGAAGTGAACGCCTTGACGGAACATTCCACGTACGCCAGGCGGAAGCTGGACCAGCTGCAAGAGAGACACAACAACAAAACACAGGCGTTGGCCGCCTTGCGGGCGTCGCTGAAGCCGGAGTCTCCGGCGCTGGCGATGCTGGCGGAGGAGGTGGAGCGGTTGGCCGCGGAGCAGATGAGGCTGGAGGCGCACCTGGCCAGGACCGACACCTGGGCCGAGAACCTGGGCCTGTGGCGCGCCACCGTACATAGTGCTGAGATGGTGGAGGAGTCTCGTCCCCAGTTCGTGGTGGTGGTACACGCGCTGCAGCCCGAGGAGGAGCGCGGCCCGCGGCCCGAGCAGAGGGCGGCCGGGTGGGTGCTGCTCAGGAGCGCTCACGACTTCCAGGAGCTGCACAGGAAACTGAGACCGATGTGTTCAGAATTAAAAAACTTAGAACTACCGTCGAATTCATTCAAATTCATGTTCGGGAAGAACGATAAGAACTCGCTCGAAAAAGCGAAAATGTTGATACAAAAATATTTAGAATTTGTTTTAGAAGACGACAGACTGAACCAAAGCGAAGCTCTGTACACCTTCCTGAACCCCAGCTCCGAGTATCTCAAGCAATGTGATCTGCCAAAGAAGAATAAGTTCTCATTCTCAACGCTATTTAAAAGCACGAGCAGCGACACGACCAACAGATCGTCCCAGGAGAAGGAGGGGCCGAGTCTGTCAGACGAGGACGAGATGTCCCTGTACCTGGACGGGAACGGGGAGGCGCTGAAACAGGGCGGCACCGTGAGAGGAGTGGGGCCGCTGGTGGAGGAGCGCGACAGTATCGCGGAGCCGCTGTACGCGCTGTTGAGCGAGGTGTTCGACATGAGGGGCGTGTTCCGCTGGCTGAGGAGGACCCTCGTCACCTTCGTTCAGATCACGTACGGCAGGACCATCAACAGACAGATCAAGGAGACGATCTCCTGGCTGTTCTCTGAGCAGATGCTGCACTACTACACCGGCCTGGTGCTGAAGTCCTGGTGGCCGGGGGGCGTCCTCACACACAGCAACACCAACAGGAACATACGGGACAAGGAGCACTCCCGCACGCTGGCGTTGCACCAGCTGACGGAGTTTGTCGTGGGCGGCGTGTCGTCGCTGGTGGGCGCGCACGCCGCCGCCCACGGGGCCAGCAAGCTGTTCCACACGCTGCAGCACACCACGCACAACAAACAGCTGTTCTACGAAATCTTCGAGTTGGTCCTCTTAGAAGTGTTCCCAGAACTGAAGCGTTATCAATGA

Protein sequence:

>DPOGS211941-PA
MTKFLLYFLVPICISSVYYFSFLWNFMYLSILSVCLITCVFVLTLCGHLSLSSPHQTSTLLLDNIEKEARQLETAIRDDQNNWVYVTKKPHLPVIFGRTVDSQLQLLIDYFIRDFVTQWLKELSHKPEPVIDKFKEHIWTAVQNLYDRLLKVDAEKLLANDMVTKITQHFERIRIARSCALELNQPPVFALAPHLMSSDMELHYLRQISEFLVMFLMPRCYSLSPVSYLIREILVCKILQPAINLVTEPDYINQKIIQYLEAQKEVDAMHVRTHEYAKTFEDYIRLINSCNNVDTLKRLRYDIVTQIMQATTLQNVKRAKGIDIDVIEKGGNHNISRQQVSDARKLKRYIDQLTIAKDECEKALRRLGWDGAFPAVESDSKAMPLHKVISSVTGRKYLSMFLETLCSQGLVGYWAAVDELRHSPRSNWHQLGAEIFYTYIRSPSAEVKVDKETRKRMEGFLLGDKGPEVFYEVQDIVVDTIQDKYYHSFLLSDQYKALVAELATEEASDPGLCSERSPIDERQGSRESSSEVNALTEHSTYARRKLDQLQERHNNKTQALAALRASLKPESPALAMLAEEVERLAAEQMRLEAHLARTDTWAENLGLWRATVHSAEMVEESRPQFVVVVHALQPEEERGPRPEQRAAGWVLLRSAHDFQELHRKLRPMCSELKNLELPSNSFKFMFGKNDKNSLEKAKMLIQKYLEFVLEDDRLNQSEALYTFLNPSSEYLKQCDLPKKNKFSFSTLFKSTSSDTTNRSSQEKEGPSLSDEDEMSLYLDGNGEALKQGGTVRGVGPLVEERDSIAEPLYALLSEVFDMRGVFRWLRRTLVTFVQITYGRTINRQIKETISWLFSEQMLHYYTGLVLKSWWPGGVLTHSNTNRNIRDKEHSRTLALHQLTEFVVGGVSSLVGAHAAAHGASKLFHTLQHTTHNKQLFYEIFELVLLEVFPELKRYQ-