Monarch geneset OGS2.0

DPOGS200730
TranscriptDPOGS200730-TA1704 bp
ProteinDPOGS200730-PA567 aa
Genomic positionDPSCF300030 + 80644-83231
RNAseq coverage839x (Rank: top 15%)
Annotation
HeliconiusHMEL0089520.087.25% 
BombyxBGIBMGA001032-TA0.083.87% 
DrosophilaSH3PX1-PA4e-14846.50% 
EBI UniRef50UniRef50_E3XGD42e-15348.15%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3XGD4_ANODA
NCBI RefSeqXP_002084311.19e-15046.52%GD12900 [Drosophila simulans]
NCBI nr blastpgi|3123706736e-15348.15%hypothetical protein AND_23209 [Anopheles darlingi]
NCBI nr blastxgi|1571362042e-15447.99%sorting nexin [Aedes aegypti]
Group
Gene OntologyGO:00055153.5e-31protein binding
GO:00071543.5e-31cell communication
GO:00350913.5e-31phosphatidylinositol binding
KEGG pathway 
InterPro domain[329-565] IPR0194972.6e-69Sorting nexin protein, WASP-binding domain
[202-336] IPR0016833.5e-31Phox homologous domain
[14-72] IPR0014522e-16Src homology-3 domain
Orthology groupMCL11733 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200730-TA
ATGTACAACGCTAGCAAAGTTCAAGGGTCCAAAATGTCGACCCAAGTACAGGCTTTGTATGACTTTACTGGTGAACCAGGCACAACCGAGATGTCTATCACCTCTGGAGAAATACTGACACTTATAAATACAGATATAGGGGAAGGCTGGTGGGAGGGGCGAAATTCCAGAGGTGAAACTGGTTTATTTCCTGCTGCATATGTGAGAAAAGTAACACCAGATGAGTCTGCACCCACAAAAATGGCTCCTCCAGCACCAAGATACGATCAGGCTGCAGATGATTGGGGAGATCACCAATACAGTGCTGGCGACAATAATTATCAAAGGGCGGCTTCACATGACGAGGGCTGGGATGATGATTGGGAGGATGACACATATTCAGAAATTGGACCTGGCCCACAGCAATCAAAGCAGGCAGTGAATCAACCTCTTACACCATTACCGGGAATGCCAATTAGTGATCTCAACCATCAGATGGACGACAACTCATCTACCTTTGGTTCATCAGTTGGCACAGTGAGAAAAAATAAATTTGCACCATCATCTAAAGTCAGCGGTGAGAGTTACCTTTTAGGTACTTTAAATGTTGAAGTACCAGATGCTGATAAAATATACATAGAACAAGAAGGGGATGCATATATTTGGTCGCCCATACCACAACCATATAATGTAACTGTTGCATCACCAAAAAAGGAATCCAAATTTAAGGGTATTAAGAGCTTCATAGCATATCAATTGACTCCCTCCTTTAATAATATTCAAGTATCCAGAAGATATAAGCATTTTGACTGGCTTCATGAGAGATTGCAGGAGAAATTTACACTCATCCCAATCCCACCTTTACCTGACAAACAGATCTCTGGAAGATATGACGAACAATTGATTGAGAGAAGAAGAGTTCAGTTACAGGAGTTTGTGGATTGGATGTGTAAACATCCAGTACTATCCAGATCGGAGGTCTGGCAACATTTCCTAACTTGCACAGATGAGAAACGTTGGAAGGCTGGTAAAAGACAAGCGGAGAGAGATAATTTATTAGGACTTAACTACTGTATATCATTAGTTGTACCTGAAAAAGCTTTACTTCAATCACAAGTAGACCACATCACGGAACAATGCCACATTTTCATGAATAGCATGGATAGTTCTGTTAAATCTCTGACAAATATGTGTATAGCACAAACAAAACGATTCCAAGGGCCTTATAAGAGCGATTGTCAAAAAGTAGGAGAGGCTTTTTACAACTTAGGAAATGCACTAAGTTTAGATGAAGGCACAATAGTTTCTACTTCAAAACTAACTTCAGCTATCAAAATGGCTGGCGGGGCCTACATTGAAATAGGCAGAATGTATGAGGAACAACCAAAATATGATTTCGAACCACTCGGTGATAAATTTCATCTTTACAAAGGTATAGTTGGCTCATTTCCTGATGTATTAGCAAATCACAAAGCAGCTGTGCAGAAGAAAAAAGAGTGTGAGAGATTGAGAGCTGAAAATAAAATGGAAAGGGAACAATTAAATGAAGTGTTTAGAAGAAATAATGTCATATCATATGCCCTTCTTGCCGAAATAAACCACTTCAAGTCGGAGAGGACGGTCGATTTAAATGCAACAATGCAGAAATTTCTCAAGCAGCAAATAACATTTTATAAGAAGATAGTTGATAAATTGGAAACAACACTACAACAGTTCCAAGAATAG

Protein sequence:

>DPOGS200730-PA
MYNASKVQGSKMSTQVQALYDFTGEPGTTEMSITSGEILTLINTDIGEGWWEGRNSRGETGLFPAAYVRKVTPDESAPTKMAPPAPRYDQAADDWGDHQYSAGDNNYQRAASHDEGWDDDWEDDTYSEIGPGPQQSKQAVNQPLTPLPGMPISDLNHQMDDNSSTFGSSVGTVRKNKFAPSSKVSGESYLLGTLNVEVPDADKIYIEQEGDAYIWSPIPQPYNVTVASPKKESKFKGIKSFIAYQLTPSFNNIQVSRRYKHFDWLHERLQEKFTLIPIPPLPDKQISGRYDEQLIERRRVQLQEFVDWMCKHPVLSRSEVWQHFLTCTDEKRWKAGKRQAERDNLLGLNYCISLVVPEKALLQSQVDHITEQCHIFMNSMDSSVKSLTNMCIAQTKRFQGPYKSDCQKVGEAFYNLGNALSLDEGTIVSTSKLTSAIKMAGGAYIEIGRMYEEQPKYDFEPLGDKFHLYKGIVGSFPDVLANHKAAVQKKKECERLRAENKMEREQLNEVFRRNNVISYALLAEINHFKSERTVDLNATMQKFLKQQITFYKKIVDKLETTLQQFQE-