Monarch geneset OGS2.0

DPOGS214111
TranscriptDPOGS214111-TA2514 bp
ProteinDPOGS214111-PA837 aa
Genomic positionDPSCF300014 - 1881801-1885027
RNAseq coverage295x (Rank: top 38%)
Annotation
HeliconiusHMEL0114060.081.51% 
BombyxBGIBMGA006161-TA0.067.20% 
DrosophilaCG33214-PA0.048.31% 
EBI UniRef50UniRef50_E2AIP60.052.29%Golgi apparatus protein 1 n=8 Tax=Formicidae RepID=E2AIP6_CAMFO
NCBI RefSeqXP_001652319.10.053.68%MG-160, putative [Aedes aegypti]
NCBI nr blastpgi|1571145710.053.68%MG-160, putative [Aedes aegypti]
NCBI nr blastxgi|1571145710.053.68%MG-160, putative [Aedes aegypti]
Group
Gene OntologyGO:00160203.2e-14membrane
KEGG pathwayisc:IscW_ISCW0077500.0 
 K06816 (GLG1, ESL1)maps-> Cell adhesion molecules (CAMs)
InterPro domain[133-186] IPR0018933.2e-14Cysteine-rich Golgi apparatus protein 1 repeat
Orthology groupMCL15033 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214111-TA
ATGTCGCTCCATCCCAACTGCCAATCTGAAATATCGTCACTCAAGGAGATGAAGTATAATACACTTCATTTAGATAAAATGGTATTTGCTGCTTGTAATTTAGATCAGAAAAACTTTTGTCCTGATGAAGTTCCAGATTCATTATTACTGTACAAATGTCTTGTTAGACATAAATATGAAAATGGGATGTCTAAACGTTGCCAGGATCAGTTATTTTATACACAACGAACAATGGTGCAGAACTATAAAATGAGCAAGGGCTTAGTTAAATCCTGCAAGGAGGACATTCGTAAATACCATTGTCGAAAAGGTGTAGTTGAAGATAAAGATGTCCGTCTTGCACAAATTTTATTGTGTTTAGAAAATGTTACCCGCAATGACAGCACAAAGCTCTCTCCCGAATGTGTCGCGGAAATGACAGATCATCGAAAAATGCTAATGGATGATTACAGGCTATCACCAGAATTAATGAAGAATTGTGCAAATGACATAACTATGCTATGTAGAGGTATTGAAACTGGTGGAAAAACAATTCATTGCCTAATGGACCATGCAAGACCGAGGAGGAGAAAAGATAAAAGGATCAGCTTAGCATGTCAAAGGTCATTAGAAATTCTTGTACAAGAAGCCGATCCTGGTGAAGACTGGCGAGTAGATCCGATTTTACGTAAAGCTTGCAAACCAGTTGTAGATACAGCATGCAGAGAGGTCAATGGTGGGAATGGTAGAGTTATGTCTTGTCTTATGGAAAAACTGGGAACTGTTCTCATGACACCTGAATGTGAAGCTGCTTTGATGCAAATACAATATTTTATATCTAGAGATTTCAAGTTAGATCCCCAATTATATAAAGCATGTAAATATGATGCTGTCACCCAGTGCAAAGCTAAATTGAAATGGTCTGATGCAAATGAACATCAATCTGAGAAAGATCCTCTTGTGTTACCATGTCTGTATAACTATGCTTATGACTCTAATCTGAGAGGTATATTAAAACCAGCTTGTGAGCAACAAGTTAAGAGAGTCATGAGACAAAGGGCCGTCAGTGTTGATTTACTGCCTGAAATAGCTGATAATTGCATGGATGACTTAACAAATTTATGTTTTGAAAATACGGGTAAAGGCGAAGAAATATTGTGCTTGCAAAGTAAAATTAAAGATCTTACTCCAAAATGTAAAGATGTTGTTACAAATTTTACTGAAACTCAGAGTGGTCATATAGAGTTAAATGCAGTTGTAAGTATAAACTGTAGAGTTCCTATAGAAAAGTTGTGTTCGTCGGAGCTAAAAAGTAAGAAAGATGAAGATGATATTTTGGAGTGTTTAATTATGCACAAGAATGACGCCGAGATAAAAGTCAATGTCAAATGCAGGGCAGCTATAGAACATGAACAATTGATATCACTTAAGAATTACAGATTTACTAGAAAATTTAAAAACGCATGTAAATCTTATGTTGTTAGATTTTGTCCGAAAGCACAAACAAAATTGCAGGTTGTTATGTGTTTAAGTGAAATTATAAGAAACGATACTATCACGAGACGAAAGCATACTATTTATAAAGAATGTCGTCAGCAGCTGAGAAGTCAACTTTTCCAACAAAAAGAAAATATCGATCTCGATCCTGATCTAAAAGAGGCTTGTAGAAAGGACTTACAAGAATTTTGTCCGACCATACCTCATGGAGAATCAGCTGCTTTAGAGTGTTTACAAACTGCAAAAGTAAAACTAAGCGATGGTTGTAGAAAAGCTTTATTTGTTGTTAGGAAACAGGAATTTGCAGACAATGCTATCGATTACCATTTAGTTAAGAGCTGCAGCGATATGATAGACTTGTACTGTCATAATACTGAACCAACAGTCTTATTAGATTGTCTAAAGGCGCATAGACAGGAAGACGATTTTGATAACAACTGCAAAATTGTTGTCATTAACAGAATGATAGAACAAAATATGGATTATCGATTTAATAATAATTTGCAAAACGCATGTGATGGCGATATAAAAAAATATTGTAGTAATGTTATTTTAAATGAACCAAAAGATGTAGAACTTCGTGGAAAAGTTCTGTATTGCTTAAAAGAAAAGTTTAGAGAGTCAAAATTAGAAAAAAAATGTGAAAACGAACTGGCTAACGTCTTGAAAGAACAAGCTTTAAATTATCGCTTAGATCCACTGTTAGGGAAACTTTGCAAGGCTGAAATTCAAACAATATGTGCAGTACCCAATGACTCCATAACAAACTCTGATGGTCAGGTTGAGGAGTGTCTAAAGAATGCCCTATTGAACCATAAAATAGTGTCTGCAGAATGTGCCCAGGAAGTTGTTCAAATTATAGAGGAAACTGAGGTAAATGTAATCGAAAATTTGGGAGATGTCTACACTGAAATATCATCTTCACCGTCTAAGAAATATTTTCTAGTTGTAGGGATATCCATTGTGGGCTTAATTTTTATATTTGGTCTATATTGTGGTCGTATGACAAAAAGGGCTATGTATATAAAGAGAAAATAG

Protein sequence:

>DPOGS214111-PA
MSLHPNCQSEISSLKEMKYNTLHLDKMVFAACNLDQKNFCPDEVPDSLLLYKCLVRHKYENGMSKRCQDQLFYTQRTMVQNYKMSKGLVKSCKEDIRKYHCRKGVVEDKDVRLAQILLCLENVTRNDSTKLSPECVAEMTDHRKMLMDDYRLSPELMKNCANDITMLCRGIETGGKTIHCLMDHARPRRRKDKRISLACQRSLEILVQEADPGEDWRVDPILRKACKPVVDTACREVNGGNGRVMSCLMEKLGTVLMTPECEAALMQIQYFISRDFKLDPQLYKACKYDAVTQCKAKLKWSDANEHQSEKDPLVLPCLYNYAYDSNLRGILKPACEQQVKRVMRQRAVSVDLLPEIADNCMDDLTNLCFENTGKGEEILCLQSKIKDLTPKCKDVVTNFTETQSGHIELNAVVSINCRVPIEKLCSSELKSKKDEDDILECLIMHKNDAEIKVNVKCRAAIEHEQLISLKNYRFTRKFKNACKSYVVRFCPKAQTKLQVVMCLSEIIRNDTITRRKHTIYKECRQQLRSQLFQQKENIDLDPDLKEACRKDLQEFCPTIPHGESAALECLQTAKVKLSDGCRKALFVVRKQEFADNAIDYHLVKSCSDMIDLYCHNTEPTVLLDCLKAHRQEDDFDNNCKIVVINRMIEQNMDYRFNNNLQNACDGDIKKYCSNVILNEPKDVELRGKVLYCLKEKFRESKLEKKCENELANVLKEQALNYRLDPLLGKLCKAEIQTICAVPNDSITNSDGQVEECLKNALLNHKIVSAECAQEVVQIIEETEVNVIENLGDVYTEISSSPSKKYFLVVGISIVGLIFIFGLYCGRMTKRAMYIKRK-