Monarch geneset OGS2.0

DPOGS203367
TranscriptDPOGS203367-TA2844 bp
ProteinDPOGS203367-PA947 aa
Genomic positionDPSCF300003 + 176711-188605
RNAseq coverage1158x (Rank: top 11%)
Annotation
HeliconiusHMEL0135300.077.38% 
BombyxBGIBMGA003892-TA0.078.83% 
Drosophilabeta'Cop-PA0.068.97% 
EBI UniRef50UniRef50_O626210.068.97%Coatomer subunit beta' n=25 Tax=Opisthokonta RepID=COPB2_DROME
NCBI RefSeqNP_001166610.10.078.75%coatomer protein complex subunit beta 2 [Bombyx mori]
NCBI nr blastpgi|2905608910.078.75%coatomer protein complex subunit beta 2 [Bombyx mori]
NCBI nr blastxgi|2905608910.078.75%coatomer protein complex subunit beta 2 [Bombyx mori]
Group
Gene OntologyGO:00068863.9e-155intracellular protein transport
GO:00301173.9e-155membrane coat
GO:00051983.9e-155structural molecule activity
GO:00161923.9e-155vesicle-mediated transport
GO:00055152.5e-72protein binding
KEGG pathwaypcs:Pc20g039702e-77 
 K05236 (COPA)maps-> Neuroactive ligand-receptor interaction
InterPro domain[1-909] IPR0164530Coatomer, beta' subunit
[320-779] IPR0066923.9e-155Coatomer, WD associated region
[335-375] IPR0159432.5e-72WD40/YVTN repeat-like-containing domain
[1-297] IPR0110464.1e-70WD40 repeat-like-containing domain
[198-487] IPR0110483.9e-16Cytochrome cd1-nitrite reductase-like, C-terminal haem d1
[220-257] IPR0197812.2e-11WD40 repeat, subgroup
[218-257] IPR0016802.5e-10WD40 repeat
[158-172] IPR0204721e-05G-protein beta WD-40 repeat
Orthology groupMCL13157 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203367-TA
ATGCCGTTAAGATTGGAAATCAAGAGGAAGCTGACAGCGCGATCTGATCGCGTCAAGTGCGTCGACCAGCACCCAACAGAACCGTGGCTGCTTTGTTCGCTGTACAGCGGCGACGTCAATATATGGAACTATGAAACACATACACAGATCAAAAGATTCGAGGTGTGCGACTTACCAGTGAGGGCTGCTAAATTCGTGATGCGGAAGAACTGGGTCGTGACAGGATCTGACGATATGCAGATCAGAGTGTTTAATTACAATACTCTAGAAAGGGTGCACAACTTTGAGGCTCACTCGGACTACATAAGATGCATCGTCATACATCCCACACAGCCTTACATACTGACAAGCAGCGACGATCTCCTCATCAAGCTGTGGAACTGGGACCGTAACTGGGCTTGCCAGCAAGTGTTCGAGGGCCACACCCATTACGTGATGCAAATTGTTATCAACCCTAAAGATAACAATACATTTGCCAGCGCAAGTCTCGATACCACTGTTAAGGTTTGGCAGCTCGGTTCATCAATCTCAAACTTCACATTGGAAGGTCACGAGAAAGGTGTCAACTGTGTGGATTACTACCACGGTGGTGAAAAGCCCTACCTCATCAGCGGTGCCGATGACCGTCTTGTCAAGATATGGGATTACCAGAACAAGACATGTGTTCAGACGCTAGAAAGTCACGCCCAGAATGTCACAGCGGTTTCCTTCCACCCGGAACTTCCCATCCTGCTTACGGGCTCCGAGGACGGCACCGTGAGGATCTGGCACGCCGGGACATACAGACTGGAAGCGGCCCTCAACTACGGCTTTGAAAGAGTATGGACTTTATCATCACTCCACAGATCCAACAATGTGGCTATCGGATATGACGAAGGTACCATAATGATCAAAGTTGGAAGAGAAGAGCCCGCTATATCCATGGATGTGAACGGTGGGAAAATAATTTGGGCCAAGCATTCTGATATGCAGCAAGTTAATTTGAAAGCTCTACCCGAAGGTACAGATATAAAAGATGGCGAACGGGTCCCAGTGGTTGCTAAAGATATGGGTTCCTGTGAGATATATCCCCAGACGATAGCCCACAATCCAAACGGACGTTTCGTGGTTGTGTGCGGTGATGGGGAATACATAATATACACAGCCATGGCCTTGAGGAATAAGGCCTTCGGAACAGCCCAGGAGTTTGTGTGGGCTTTGGATAGCTCGGAGTACGCTACACTGGAGAATTCTAGCACAGTGAAAGTCTTTAAGAACTTCAAGGAGAGGAAGAGCTTTAAACCTGAATATGGCGCTGAAGGTATCTTCGGTGGATTCATGCTGGGCGTTAAGTCTATCAGTGGCATGGCCTTCTCGTTCTACGACTGGGAACAATTGGAGCTCATTAGACGTATCGAGATTCAGCCTCGTCATGTTTTCTGGTCTGAGAGCGGAAGCCTAGTGTGTCTGGCCAGCGAGGAGGCCTACTACGTGCTGAAGTACAACGCTTCTGTCGTAGCTAAATCAAGAGAAAATAATACTAACGTAACCGAGGACGGCATCGAGGATGCTTTCGAGGTTGTGGGCGAAGTAAATGAGTCGGTGAAGACGGGCTTGTGGGTAGGCGACTGCTTCATATACACCAACTCGTTGAACAGAATCAACTATTACGTCGGCGGTGAGATTGTGACCATAGCGCACTTGGACCACACGATGTATATCCTGGGATACGTCGCTAAAGAAAACAGGCTGTACCTCAACGACAAGGAGTTGAACATAGTGTCGTATTCCCTCCTGCTGCCGGTTCTGGAGTATCAGACGGCGGTGATGAGAGGTGACTTCGAAACAGCTGATCGCGTCCTGCCGACCATACCTCACGATCATCGCACCAGGGTCGCACATTTTCTCGAGAAACAGGGCTTCAAACAACAAGCTCTGGCTGTGTCAACGGAGCCCGAACACCAGTTCGAGCTGGCCCTGTCGCTGGGCGAGCTGAAGAAGGCCAGCCAGTTGGCAGAGGAGTCAGATAAGGCCGAGGGCCGCGAGGACAACCAGCCCTCGAGGCCTTCAGCTGCCAGGTGGTCCAGATTGGGAGCAGCAGCTGCAGCAGCTGCAGACACTGATCTCACCAAGTTCTGCTACCAGAAGGCCCGCGACTACAGCGCCCTGCTACTATTCTCCGTCAGCACTGGCGATCGTGAGTTGCTGGAAGAGGTGGCTCATATGTCCGATCTGGCCGGTGAAGATAACATAGCCTTCACATCCTATCTTACTCTGAATGACCTGGACTCTTGTCTGGCGCTGCTTCTCAAACGAAACAAACTACCAGAGGCTGCGTTCTTCTGCAGGTCATACTATCCTTCAATGATGAGCGATGTCCTCAAACGTTGGAGGGATTCCGTCTCTATGACCAATCCCAAGTGCGGCCAGGCCTTGGCCGATCCCAACAAATACGACAACCTGTTCCCGGAATACATGGATACCCTGGCGATGGAGTTCTACCAGAAGCACTTTGGTTATCCGTACTACAATCAGTTGGAGCATATCAAAGAGAACACTGATTTATGCAATGTTGACCGAGACATGGCTCACGAAAGGCTGGTCGCTATCCACATGGGCGCCTGGGACCCTAGGGTCATAACCCCACCATCCGGTGCTTCAGGTCTCTCCAGTCTACAGGACAGTCCGAGACGAGATCCCAGAAATCCAGATAGTTCAGATGAAGCTTCCTATTCTGATGAAAAGATCAGACGTAGAGACTCCATGGACATCCTCGAAGAGATTGAACGTGAGATAGACAACATTGTGCTGGACAACAACGAAGAGGATCTGGATTCGTCAGACGAGACCATGTATCTTGAATAA

Protein sequence:

>DPOGS203367-PA
MPLRLEIKRKLTARSDRVKCVDQHPTEPWLLCSLYSGDVNIWNYETHTQIKRFEVCDLPVRAAKFVMRKNWVVTGSDDMQIRVFNYNTLERVHNFEAHSDYIRCIVIHPTQPYILTSSDDLLIKLWNWDRNWACQQVFEGHTHYVMQIVINPKDNNTFASASLDTTVKVWQLGSSISNFTLEGHEKGVNCVDYYHGGEKPYLISGADDRLVKIWDYQNKTCVQTLESHAQNVTAVSFHPELPILLTGSEDGTVRIWHAGTYRLEAALNYGFERVWTLSSLHRSNNVAIGYDEGTIMIKVGREEPAISMDVNGGKIIWAKHSDMQQVNLKALPEGTDIKDGERVPVVAKDMGSCEIYPQTIAHNPNGRFVVVCGDGEYIIYTAMALRNKAFGTAQEFVWALDSSEYATLENSSTVKVFKNFKERKSFKPEYGAEGIFGGFMLGVKSISGMAFSFYDWEQLELIRRIEIQPRHVFWSESGSLVCLASEEAYYVLKYNASVVAKSRENNTNVTEDGIEDAFEVVGEVNESVKTGLWVGDCFIYTNSLNRINYYVGGEIVTIAHLDHTMYILGYVAKENRLYLNDKELNIVSYSLLLPVLEYQTAVMRGDFETADRVLPTIPHDHRTRVAHFLEKQGFKQQALAVSTEPEHQFELALSLGELKKASQLAEESDKAEGREDNQPSRPSAARWSRLGAAAAAAADTDLTKFCYQKARDYSALLLFSVSTGDRELLEEVAHMSDLAGEDNIAFTSYLTLNDLDSCLALLLKRNKLPEAAFFCRSYYPSMMSDVLKRWRDSVSMTNPKCGQALADPNKYDNLFPEYMDTLAMEFYQKHFGYPYYNQLEHIKENTDLCNVDRDMAHERLVAIHMGAWDPRVITPPSGASGLSSLQDSPRRDPRNPDSSDEASYSDEKIRRRDSMDILEEIEREIDNIVLDNNEEDLDSSDETMYLE-