Monarch geneset OGS2.0

DPOGS201668
TranscriptDPOGS201668-TA2451 bp
ProteinDPOGS201668-PA816 aa
Genomic positionDPSCF300103 - 346695-352941
RNAseq coverage531x (Rank: top 24%)
Annotation
HeliconiusHMEL0035890.074.04% 
BombyxBGIBMGA005348-TA0.066.29% 
Drosophilasog-PA4e-16237.87% 
EBI UniRef50UniRef50_Q240257e-16037.87%Dorsal-ventral patterning protein Sog n=15 Tax=Diptera RepID=SOG_DROME
NCBI RefSeqXP_002044153.15e-16137.87%GM22543 [Drosophila sechellia]
NCBI nr blastpgi|1953553471e-15937.87%GM22543 [Drosophila sechellia]
NCBI nr blastxgi|1700404755e-16037.09%chordin [Culex quinquefasciatus]
Group
Gene OntologyGO:00055153.5e-12protein binding
KEGG pathwaydse:Dsec_GM225432e-160 
 K04657 (CHRD)maps-> TGF-beta signaling pathway
InterPro domain[575-631] IPR0010073.5e-12von Willebrand factor, type C
[35-153] IPR0108954.8e-08CHRD
Orthology groupMCL15716 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201668-TA
ATGATAGCGACGTGGAGACGCCGATCCTCACCAGACATTGTGCAAGATGATGGTCCTATATCTATAATAACCAACAATGAAGACGAGGAATTGAGTATGAGACATTTTGGGGCCTTATTAACTGGTCGATCTCCGCTATGCATCAGACGAGATGATATGACGGGAGTTCCTGTGGCTTCAGGTGTGGCAACTGCAAGGTTTACTTTTCGACGTCGCCATCTATTTTGGTCTGTTATATTAGGGACGTCTGTTAGCATCCAGCCACGTGCCTTGGCTTTTCTTGACAGAGGAGGACGTATGCTACTTGAACATCCAATAAAAAGAGCTACAGGAATACATGCCACTTACGAGGAAAAAACTGATAAGTTGTGTGGTGTCTGGCGTCGAGTGCCTCGTGAATACAAACGAATATTGAGAGAAGGCTCTTTGTATGTCGCTTTGATTTGGGGTGACGAGCGTAACAATTCTATGGAAAGTGCATTGAGTGGACGAATAAATAGATATCCAGCACTAGCGACAGAAATGTTTACGTCTTTGCTGGAACCGGATCATGTTCCAACGTCATTTAGCAGTGGTCCATGTGATGAGTTTCGGGTACCCGGTAAAGAAGAGGGTTGGTGGGGAGGAACTGCTGTTGTAACTGGCGCTGCTGGTGCCGCCCCAAGTTTGCATTTGGCGATTGTATATAATGGTGTCTTCATGCCTGCGGCAAGAGATCAGTCAGTCAGAGTACTATTGACTTTGCCTGATAGAAATCAGACCATTATTGATGAAGTCCAAAAGATAAGTAAACCGGGGTACGAATTGAATGTTCTAGAAGTATCCACACCTGTTTCCGCTGCGGAGTTGCGTTCTTTATCACGTGGAAGATTACTATTGTCAGTAGAAGCAATGGGAACACCAGACAGAAGAATTTCTGGTATTGTCAGACAGAGAGCTGCTTGTGAAGTCTTCCACGCTCCGTTAATCGCGGAAAGATTACCAGCTGCATTGATACCGCAGGGATTGGCCTTAGTTTATATTGACAAAGATGGATCTCTTGTATACGATATACAGGTAGAGAATTTGAGTATAATCGACCCAAAGATCACATTGGTCGAAGAGCAAGGTAAACGTCATTCGCAAGTGGAGATATTAGATACTAGGATGGGAATTCTAGCTCGACCCAGTGCCCGTATCTTCCCTCCACTTTATGAAGATCAGCTTGCAGTACATATTGGTTCTGATACAGGTCCGCCAATCCTGAGAGGTCGTTTACTGTCACGGCCACTGCCAGATGCGGCTAGTGAAGGGCCGTCATTACTACGGCGAACAGATGCTCGAATGCCACCAACTATAAACCCAGTAGCTGGACTTGCTTGGATTGCTATTGACGTTTTGTGCGGATTAAATTATGAGGTTGTTGTAAATGGTTACGCTGGTTCTTGGACTGCATGGCTGGAAGGGAATCCTAATGGACCAAGAATATTACCTGGGATAGAGGGTTCAATTCTGGAACCATCTCCTTCAGAACTACTGGCGTTGAATTCTGGATCAGCACATCTTAATATTCGAACTTCCGACAACGATACTGAGTTTCTACGAACGCGGTTACCCCAGATAAGCGTGCCACCTTCATGTCTACCAGCAGGGTCGTATTCTGACAATGAGTTAAATTCCAACTACGCACAAACACATATGAATTCGCCTCCACCAGACAATTCGTTATCTAACACAGCTTCTTGTTATTATGCCGGAAAGTCCTATGAAGATGGGTCACAGTGGATGGCTGCAGAGTCATGTCACATGTGTGGGTGTGTTCATGGCGCGCTGCGTTGTGACGCGGTCCGCTGTCCCCCGGTCACCTGTGCTGTTCCCACTCTGCGTCCTCCAGGACAATGTTGTCCAATTTGCACCAACTCAACTAAAGCCGTATGGAATGAGTCCCATGGTTGTCACCTTGCTGGACAGTACCACGCACCCGGTTCCTCCTGGCATCCTTACTTAGTACCTGGAGGTTACGACACTTGTGCGATATGTACGTGTGAGTTCGCAACACGACAAGTACGCTGTCCACGTGTTCGATGTCCGCCTTTGAAATGTGCTGAGAAGGATGCCTACCGACCCGATAAGAAGGCTTGCTGTAGAGTTTGTCCCGAAGTAAAAGCGAAAAAGACGGAAGAAGATACACCAAGAGACCAAGGTACGCCTCGGACAGCTGAAGAGATTTTAGCTGAAGGTGGATGCAAGTTTCCGGATGGTCCCTTGCCCAATGGCAAAGAGGTGCATCCATCCATTCACTCCCATGGCGAGCAGAGATGCGTAACTTGCCGGTGCAAGGATGGCGAAGTGACGTGCATTCGTAAGAGATGTTCACGGGCGGCGTGTGCACGGCGGCGACGCGGCGACGCGTGCTGCGCTTGCGCAAGACACCGCCGCCAGCGCGCCCCACCTCCCCCGCCACCAAGTTGA

Protein sequence:

>DPOGS201668-PA
MIATWRRRSSPDIVQDDGPISIITNNEDEELSMRHFGALLTGRSPLCIRRDDMTGVPVASGVATARFTFRRRHLFWSVILGTSVSIQPRALAFLDRGGRMLLEHPIKRATGIHATYEEKTDKLCGVWRRVPREYKRILREGSLYVALIWGDERNNSMESALSGRINRYPALATEMFTSLLEPDHVPTSFSSGPCDEFRVPGKEEGWWGGTAVVTGAAGAAPSLHLAIVYNGVFMPAARDQSVRVLLTLPDRNQTIIDEVQKISKPGYELNVLEVSTPVSAAELRSLSRGRLLLSVEAMGTPDRRISGIVRQRAACEVFHAPLIAERLPAALIPQGLALVYIDKDGSLVYDIQVENLSIIDPKITLVEEQGKRHSQVEILDTRMGILARPSARIFPPLYEDQLAVHIGSDTGPPILRGRLLSRPLPDAASEGPSLLRRTDARMPPTINPVAGLAWIAIDVLCGLNYEVVVNGYAGSWTAWLEGNPNGPRILPGIEGSILEPSPSELLALNSGSAHLNIRTSDNDTEFLRTRLPQISVPPSCLPAGSYSDNELNSNYAQTHMNSPPPDNSLSNTASCYYAGKSYEDGSQWMAAESCHMCGCVHGALRCDAVRCPPVTCAVPTLRPPGQCCPICTNSTKAVWNESHGCHLAGQYHAPGSSWHPYLVPGGYDTCAICTCEFATRQVRCPRVRCPPLKCAEKDAYRPDKKACCRVCPEVKAKKTEEDTPRDQGTPRTAEEILAEGGCKFPDGPLPNGKEVHPSIHSHGEQRCVTCRCKDGEVTCIRKRCSRAACARRRRGDACCACARHRRQRAPPPPPPS-