Monarch geneset OGS2.0

DPOGS209971
TranscriptDPOGS209971-TA3696 bp
ProteinDPOGS209971-PA1231 aa
Genomic positionDPSCF300148 + 95105-112180
RNAseq coverage400x (Rank: top 30%)
Annotation
HeliconiusHMEL0135594e-16566.67% 
BombyxBGIBMGA011331-TA0.083.22% 
DrosophilaLiprin-alpha-PA0.055.26% 
EBI UniRef50UniRef50_Q7PKP80.058.15%AGAP001469-PA n=9 Tax=Coelomata RepID=Q7PKP8_ANOGA
NCBI RefSeqXP_002432747.10.057.83%Liprin-alpha, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3320284360.058.54%Liprin-alpha-2 [Acromyrmex echinatior]
NCBI nr blastxgi|2420246650.058.00%Liprin-alpha, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055151.5e-11protein binding
KEGG pathway 
InterPro domain[967-1047] IPR0109931.5e-11Sterile alpha motif homology
[1144-1213] IPR0115101.7e-10Sterile alpha motif, type 2
[970-1039] IPR0016608.9e-10Sterile alpha motif domain
[1060-1121] IPR0211291.5e-09Sterile alpha motif, type 1
[1066-1121] IPR0137611.6e-06Sterile alpha motif-type
Orthology groupMCL10462 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209971-TA
ATGGTGTTTGTGCAAATCGCCTGGTGCTGTAATTCTCATATCGCCATTAAATCGTTCGAGGAGCATGGTGATAAGGACGATAAAGTAGTATGGTGTGAGGGTGAGAGGGTGAGTGTGGGTGTGGAGGAGGGATGTTCGGAAGGCATGCAGCGCGGCGGGGGCGGCGCCTGCGCATTGCTGGGGGCGAAATGCACGCGCAGGAACGGGGAGCTCTGCACGTGCACCGTGCACACAACCCTACTGCAGGAATTCGCGACCCTCACGAAGGAGCTCAACCAGGCCCGGGAACAGCTCCTGGAACGGGAGGAGGAAATTTCGGAACTCAAGGCGGAAAGAAACAATACCAGATTGTTACTGGAGCACTTAGAATGTCTGGTGTCGCGGCACGAGCGCTCTTTGCGCATGACGGTGGTGAAGCGGCAGGCCGCCGCCCAGTCTGGGGTCTCCTCGGAAGTGGAGGTCCTGAAGGCCCTCAAGAGTCTCTTCGAACACCACAAGGCCTTAGACGAGAAGGTTCGAGAAAGATTAAGAGTGGCGCTAGAGAGGAACACGGCTCTGGAAGAGGAGTTGGCTCTGACCAAAGAAGAGTTGCAGCAATATAAATCATCCGGTGCCCAGGAAGGCGACAAGCCCAAGGAGAATGGCACCACCGCCACCGGCTCGCCCGAACAGAACGGGGAACCCCCCAAGGAACAAAGTAGTGTTAACGGCGAAACGGACGTCAAGAAGCTCACTGAACTACAAAACACTATCGCCAAACAGTCGGCGGAACTGAGCTCGTGGCAGAGACGGGTCGCCGAGCTGAACAACAAGGTCACGGAACTGGAGGAGAGGTTGAGTAAGGGGGAGAAGGAGTTGGTCAAGAAGCAGGAGGAGTGCGCGAAGCTACAGAGGGACCTGAGGGAGAACGTGGCTCAGAAGGAGGATCAGGAGGAACGCATAGCCACATTGGAGAAACGATATCTTAACGCTCAGAGAGAGTCCACCTCCTTACACGACCTGAATGAGAAGCTGGAACAGGAGTTGCAGCATAAGCAGGCACAGCTTAAGCTCCAAGAAGAGAAGATAGCGGCCATAGAGGAGAAGCTGGAACTGTCCACGCAGAAGTTGGCGCAGATGTCCTCGCTGCCGGAGATGGAGGAACAGCTCAAGGCGAGGATGGAGGCCCTCAGCCAGGCGCAGGAGAGGCACGGCTCGGCCGAGGACCGCATCCAGAGGCTGGAGGCGAGTGTGGAGGAGAAGAACGCCGAGCTCATGAGACTCAACCAGCGGCTGCGGATGAACGAGGAACACAACACCAGGCTCTCCGCCACCGTGGACAAGCTGCTGTCCGAGTCTAACGATAGGTTGCAGGTCCACCTGAAAGAGCGGATGCACGCTCTGGACGAGAAGAATGCCCTCACCCAGGAGTTGGAGAAAACGAGAAAGTACGCTGATGAACTGTTACAAGAAAAGGCGGACATACTCAAGGAGCTGGCCAAGTGGAGGATGGAGACGGAACAGTTGAAGCGCCAGATGTTGCAGGCGGAGATAGCGTTCAACATCCAGCAGACGGATGCCCTGACGAGGTCGCTGTCACCAGCTGCCGCTCAGCCGCCCACCAACATATATGGAGCCAAGCTGGACGGGTCCTGGGAGAAGCTGCAACAGGCGCACGTGCTGGGTCCACCCTTCGACACCAGTGACGCCGAGAACGAGGAGTCTGAAGGTGCAGAAGGGCACACGGATGCGGCGGCGCTGGCGCTGATGCTGCAGGAACAGCTGGACGCCATCAACACGGAGATCAGACTCATACAGGAGGAGAAGCAGAGCACTGAGGCGCGAGCCGAGGAGCTGGAGTCCAGGGTGAGCTACGACCTAGACACGGATACAATCCAAACATTCATTGACAACAGCGAGGGTGGCGGTCTGTCTAAACTGTTGCAGCAGTGCACGGTCGCTATGCCAGGACCACTAGTCGGCAGTTACGAGCACATGAACGTAGTGTCTCGGCGGGCGGAGTCCCCGCCGCCGGCAGCCTCGCCCTCCAGACCGAACCATCACAAGTACCACACTGCGCCGGCGTCCATGTCTCCGGCTCACGCTCACTTCCGGCCGCCGGCCTCCGAGAGCCTGCCCTCCAGCCAGGTCGGTCCACCCCCCACCCACGCCCCACCAATCCACTCACATATCAACTCACCCTGTACTAAGCGTTCACCCCTTGTATACAAATCGCTAGTCGCTCGGAGCAGTCGTAACAGCTTTCATAGTATAGGGCATTTGCGATTCGCTTTCTCTTGTCCATATTTACCAGATCGCTGGGGAGGTTTGCAGCCCCTTGTTCTCCTCGCTCCAAGCTTGACTGTGTCTGATCCCGGTGCTAGGAGTAGTTCCAACCTTCCCCCCCACCCCGCTTCCAGTATCGCCTTACGTCCGAGCACACACGACTGTGATGTATATAGGGTGTTCCCCCAGTTGCAGCTGTGCGGGGAGGGAGACGAGCGTGCTGGGCGTCTGGAGCGGGCCATGCATCCCGACAGGGAGAGACTCCGCCCTCCACACCACTACGACTCCTCGAGCCAAGAGTCTCTGGGTGCGGGCGCGTGGGGAGGATCTCCACTCCCGCGAGGAGTGGCTTCAGCATCAGCCGTCTCTATAGCATCGATGCACCAACAGAAGAAGAGAGGAATCAAAAGCTCGTTGGGACGGTTCTTCAGTAAGAAGGAGAAAGCTGGCATGCCGATGCAGCAGGGTCAGAGTCCTCGGTCGTTGTCGTCCGCCTCCTCTCTAGGTCTGTCCTCGTTGGCTGATGAAGGTGAACCCTCGCATCCTCCCACACACCAGCACCCGGACTACGCTCGCACTAAAACTAAAGAGCGCGACTATCGCCATGAACTCCTGGGCGAGGCGATGAGAGCGGGGACGCCCTTCGCGCTCTGGAACGGTCCCACGGTAGTGGCTTGGCTGGAGCTCTGGGTGGGGATGCCGGCCTGGTACGTTGCGGCCTGCAGGGCCAACGTCAAGTCGGGCGCCATCATGTCCGCGCTGTCCGACCAGGAGATACAGCGGGAGATAGGCATCAGCAACCCTCTCCACCGGCTGAAGCTGCGCCTGGCCATACAGGAGATGGTATCATTGACCTCACCATCAGCCCCGCGAGGTACCGCTTGTGCCGCGCTGGCGTTCGGGGACATGAACCACGAGTGGATCGGCAACTCGTGGCTGCCGTCGCTCGGCCTGCCGCAGTACAGGACCACCTTCATGGAGTGCCTGGTGGACGCACGAATGCTGGAACACCTCACCAAGAGAGACCTGCGGACGCAGCTCAAGATGGTTGACAGTTTCCACAGAACTTCGCTTCACTTCGGCGTGGCATGTCTGAAGAGGGTCGGCTACTCCGTGAGGGCGCTGGAGGAACGTCGTCGGGCAGCGGAAATAGGGCTCCGGGACGTGCTCGTGTGGACCAACGAACGTCTCCAAAGGTGGCTGCTCTCCATCAACCTGAAGGAATACGCCAACAATCTGTCGGAGTCTGGAGTGCACGGAGCGTTGATAGCGCTCGACGACAACTTCGACGCGAACAGTATGGCGCTCGCGCTACAGATACCGACCCAGAACACGCAGGCGCGCCAGATCCTCGAAATGGAGTTCAACAACCTGCTGAGTACAGGCACGGAGCGTACGCGCGCACCTCACGACCACGCGCCGGCCTCCTGA

Protein sequence:

>DPOGS209971-PA
MVFVQIAWCCNSHIAIKSFEEHGDKDDKVVWCEGERVSVGVEEGCSEGMQRGGGGACALLGAKCTRRNGELCTCTVHTTLLQEFATLTKELNQAREQLLEREEEISELKAERNNTRLLLEHLECLVSRHERSLRMTVVKRQAAAQSGVSSEVEVLKALKSLFEHHKALDEKVRERLRVALERNTALEEELALTKEELQQYKSSGAQEGDKPKENGTTATGSPEQNGEPPKEQSSVNGETDVKKLTELQNTIAKQSAELSSWQRRVAELNNKVTELEERLSKGEKELVKKQEECAKLQRDLRENVAQKEDQEERIATLEKRYLNAQRESTSLHDLNEKLEQELQHKQAQLKLQEEKIAAIEEKLELSTQKLAQMSSLPEMEEQLKARMEALSQAQERHGSAEDRIQRLEASVEEKNAELMRLNQRLRMNEEHNTRLSATVDKLLSESNDRLQVHLKERMHALDEKNALTQELEKTRKYADELLQEKADILKELAKWRMETEQLKRQMLQAEIAFNIQQTDALTRSLSPAAAQPPTNIYGAKLDGSWEKLQQAHVLGPPFDTSDAENEESEGAEGHTDAAALALMLQEQLDAINTEIRLIQEEKQSTEARAEELESRVSYDLDTDTIQTFIDNSEGGGLSKLLQQCTVAMPGPLVGSYEHMNVVSRRAESPPPAASPSRPNHHKYHTAPASMSPAHAHFRPPASESLPSSQVGPPPTHAPPIHSHINSPCTKRSPLVYKSLVARSSRNSFHSIGHLRFAFSCPYLPDRWGGLQPLVLLAPSLTVSDPGARSSSNLPPHPASSIALRPSTHDCDVYRVFPQLQLCGEGDERAGRLERAMHPDRERLRPPHHYDSSSQESLGAGAWGGSPLPRGVASASAVSIASMHQQKKRGIKSSLGRFFSKKEKAGMPMQQGQSPRSLSSASSLGLSSLADEGEPSHPPTHQHPDYARTKTKERDYRHELLGEAMRAGTPFALWNGPTVVAWLELWVGMPAWYVAACRANVKSGAIMSALSDQEIQREIGISNPLHRLKLRLAIQEMVSLTSPSAPRGTACAALAFGDMNHEWIGNSWLPSLGLPQYRTTFMECLVDARMLEHLTKRDLRTQLKMVDSFHRTSLHFGVACLKRVGYSVRALEERRRAAEIGLRDVLVWTNERLQRWLLSINLKEYANNLSESGVHGALIALDDNFDANSMALALQIPTQNTQARQILEMEFNNLLSTGTERTRAPHDHAPAS-