Monarch geneset OGS2.0

DPOGS207002
TranscriptDPOGS207002-TA2718 bp
ProteinDPOGS207002-PA905 aa
Genomic positionDPSCF300001 + 1017227-1023866
RNAseq coverage921x (Rank: top 14%)
Annotation
HeliconiusHMEL0086710.090.88% 
BombyxBGIBMGA012922-TA0.085.09% 
DrosophilaBap-PA0.081.31% 
EBI UniRef50UniRef50_Q105670.071.89%AP-1 complex subunit beta-1 n=250 Tax=Eukaryota RepID=AP1B1_HUMAN
NCBI RefSeqXP_002057979.10.081.08%GJ15746 [Drosophila virilis]
NCBI nr blastpgi|1953987410.081.08%GJ15746 [Drosophila virilis]
NCBI nr blastxgi|1948931570.082.17%GG19251 [Drosophila erecta]
Group
Gene OntologyGO:00085650protein transporter activity
GO:00150310protein transport
GO:00054881.9e-205binding
GO:00068867.8e-165intracellular protein transport
GO:00301177.8e-165membrane coat
GO:00161927.8e-165vesicle-mediated transport
GO:00301312.9e-32clathrin adaptor complex
KEGG pathwaydvi:Dvir_GJ157460.0 
 K12392 (AP1B1)maps-> Lysosome
InterPro domain[1-759] IPR0163420Adaptor protein complex, beta subunit
[3-563] IPR0119891.9e-205Armadillo-like helical
[17-533] IPR0025537.8e-165Clathrin/coatomer adaptor, adaptin-like, N-terminal
[4-583] IPR0160241.9e-150Armadillo-type fold
[791-904] IPR0122951.3e-48Beta2-adaptin/TATA-box binding, C-terminal
[793-905] IPR0090283.8e-47Clathrin/coatomer adaptor, adaptin-like, appendage, C-terminal subdomain
[681-789] IPR0130372.9e-32Clathrin adaptor, beta-adaptin, appendage, Ig-like subdomain
[680-792] IPR0130416.1e-31Clathrin/coatomer adaptor, adaptin-like, appendage, Ig-like subdomain
[794-904] IPR0151517.8e-29Clathrin adaptor, beta-adaptin, appendage, C-terminal subdomain
[682-785] IPR0081521.2e-08Clathrin adaptor, alpha/beta/gamma-adaptin, appendage, Ig-like subdomain
Orthology groupMCL11325 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207002-TA
ATGACTGATTCAAAATACTTCACAACTACAAAGAAAGGCGAAATATTCGAGCTCAAGTCGGAGCTGAATAGCGATAAGAAAGAAAAGAAGAAGGAAGCTGTTAAAAAGGTGATAGCTTCCATGACCGTGGGCAAAGATGTGTCAGCTCTATTCCCTGATGTTGTTAATTGCATGCAAACCGACAACTTGGAACTCAAGAAGCTGGTATACTTGTACCTCATGAACTACGCCAAGTCCCAGCCGGATATGGCTATCATGGCTGTTAACACATTTGTCAAGGATTGCGAGGACTCGAATCCTTTGATCCGTGCTCTGGCTGTACGGACCATGGGTTGCATCCGAGTGGACAAGATCACGGAGTACCTCTGTGAGCCCTTACGGAAGTGTCTCAAGGATGAAGACCCCTATGTCAGGAAGACGGCTGCAGTGTGCGTCGCCAAACTGTACGACATATCACCCAGTATGGTGGAAGATCAGGGTTTCCTCGATCAGTTAAAGGATCTGTTGAGTGACTCAAATCCTATGGTTGTAGCGAACGCTGTCGCAGCTCTCTCTGAGATTAATGAGGCCAGTGTATCCGGACATCCGCTTGTAGAAATGAACGCCCCAACCATTAACAAGCTGCTGACAGCCCTGAACGAGTGCACGGAGTGGGGTCAGGTGTTCATACTGGACGCTCTTTCGAACTACTCCCCTCGTGATTCACGCGAGGCTCACTCTATATGCGAACGTATAACACCGCGTCTAGCCCACGCCAACGCCGCCGTCGTTCTGTCAGCCGTCAAGGTCCTCATGAAGCTCATGGAGATGTTATCAGATGAGACGGAATTAGTGAGCACTTTGTCCAGGAAGCTGGCTCCGCCCCTTGTGACGTTACTGAGTGCTGAGCCCGAAGTCCAATACGTAGCGCTACGGAACATAAACCTTGTGGTGCAGAAGAGACCGGATATATTAAAACACGAGATGAAGGTATTCTTCGTGAAGTACAACGATCCAATCTACGTGAAACTGGAAAAGCTGGATATCATGATACGTCTGGCGTCTCAAGCGAACATCGCGCAGGTCCTGGGCGAGCTGAAGGAGTACGCCACGGAGGTGGATGTGGATTTCGTGAGGAAGGCCGTCAGGGCCATAGGACGGTGTGCCATTAAGGTGGAGCCTTCAGCGGAGCGTTGCGTGTCAACCTTACTGGAATTAATTCAAACCAAAGTGAACTACGTGGTACAAGAGGCGATCGTTGTCATAAAGGACATATTCCGTAAATATCCCAACAAATACGAGAGCATTATAAGTACGTTGTGTGAAAATTTGGACACACTCGACGAGCCTGAGGCGAGAGCATCAATGGTTTGGATCGTGGGCGAGTACGCTGAGAGGATTGACAACGCCGACGAGCTCCTCGACTCTTTCCTCGAAGGTTTTCACGATGAGAACGCCCAGGTCCAGTTGCAGCTGTTAACAGCTGTTGTGAAGCTGTTCCTGAAGCGTCCAGCTGACACCCAGGAGCTCGTGCAGCATGTATTGAGTCTAGCCACACAGGACTCGGACAATCCAGATCTCAGGGACCGCGGGTTCATCTACTGGCGTCTGCTGTCAACGGATCCGGCTGCGGCTAAGGAGGTGGTCTTGGCTGATAAGCCTCTGATCTCTGAAGAAACGGACCTCTTGGAGCCGACCTTGCTAGACGAACTCATCTGCCACATAAGCTCGCTGGCGTCTGTGTACCACAAACCACCTACAGCATTCGTCGAAGGTCGCGGCGCTCCACAGGCGTTCTCAGATGGTCGAGCGCCACACACCGCGGACGAGGCGCCCGCGTCTGCGCCCGCTGTTATACCAAACCAGGAGTCCCTGATCGGTGATCTACTGTCTATGGATATCGGGGCCCCGCCTGCAGCGGCGACCGCGCCAGCACTCGACTTACTGGCTGGAGGGCTGGACGTACTTCTGGGCGGTCCTGCCGACAGCCAGCCGACAGCCAGCGTCTCTGGTAGCGCGACCGGTTTACTCGGAGACATCTTCGGAGCGACAGCACCCGCCTCATACGTACCACCCAAGCAATGTTGGCTGCCGGCTGATAAGGGAAAAGGTCTGGAGATTTGGGGTACATTCAGTCGCCAGAACGGTCAGCTTCGCATGGAGATGACGTTCACTAACAAAGCCATGCAGGCTATGAGCGGGTTCGCCATACAACTCAATAAGAATAGTTTCGGCGTGTACCCTGGAGGCGCGCTGTCTGTTGGAGCGCTCGGGGCGGAGGGGCGGGGGCGCGAGCTGACGCGCCTCTACCTCTGGCTACAGGCGGACCCGTTCAACGTGGCTGTCAAAAACAATATAGACGTATTTTACTTCGCGTGTCTCATTCCCGTTCACATCCTGTTCACTGAGGACGGACAGCTGGACAAGCGAGTGTTCCTCACCACCTGGAAGGAAATCCCAGCTGCGAACGAGTTCCAGCACACCATAACGAACGTCGTTGGCACCGCCGATTCGATCGCACAGAAAATGACCCTCAACAATGTTTTCACCATCGCTAAGAGGAACGTCGAGGGTCAAGACATGTTGTACCAGTCCCTCAAACTGACAAACAATATATGGGTCCTTCTAGAACTGAAGCTGCAACCCGGCAACCCAGAGGCCACGCTGAGCCTCAAGTCCCGCACCGTAGAGGTCGCTAATTGCATTTTCCAAGCTTACGAAGCCATCATTAAATCGTAA

Protein sequence:

>DPOGS207002-PA
MTDSKYFTTTKKGEIFELKSELNSDKKEKKKEAVKKVIASMTVGKDVSALFPDVVNCMQTDNLELKKLVYLYLMNYAKSQPDMAIMAVNTFVKDCEDSNPLIRALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLYDISPSMVEDQGFLDQLKDLLSDSNPMVVANAVAALSEINEASVSGHPLVEMNAPTINKLLTALNECTEWGQVFILDALSNYSPRDSREAHSICERITPRLAHANAAVVLSAVKVLMKLMEMLSDETELVSTLSRKLAPPLVTLLSAEPEVQYVALRNINLVVQKRPDILKHEMKVFFVKYNDPIYVKLEKLDIMIRLASQANIAQVLGELKEYATEVDVDFVRKAVRAIGRCAIKVEPSAERCVSTLLELIQTKVNYVVQEAIVVIKDIFRKYPNKYESIISTLCENLDTLDEPEARASMVWIVGEYAERIDNADELLDSFLEGFHDENAQVQLQLLTAVVKLFLKRPADTQELVQHVLSLATQDSDNPDLRDRGFIYWRLLSTDPAAAKEVVLADKPLISEETDLLEPTLLDELICHISSLASVYHKPPTAFVEGRGAPQAFSDGRAPHTADEAPASAPAVIPNQESLIGDLLSMDIGAPPAAATAPALDLLAGGLDVLLGGPADSQPTASVSGSATGLLGDIFGATAPASYVPPKQCWLPADKGKGLEIWGTFSRQNGQLRMEMTFTNKAMQAMSGFAIQLNKNSFGVYPGGALSVGALGAEGRGRELTRLYLWLQADPFNVAVKNNIDVFYFACLIPVHILFTEDGQLDKRVFLTTWKEIPAANEFQHTITNVVGTADSIAQKMTLNNVFTIAKRNVEGQDMLYQSLKLTNNIWVLLELKLQPGNPEATLSLKSRTVEVANCIFQAYEAIIKS-