Monarch geneset OGS2.0

DPOGS205727
TranscriptDPOGS205727-TA942 bp
ProteinDPOGS205727-PA313 aa
Genomic positionDPSCF300476 + 35224-37803
RNAseq coverage107x (Rank: top 60%)
Annotation
HeliconiusHMEL0178782e-11385.46% 
BombyxBGIBMGA005233-TA2e-9873.82% 
Drosophila% 
EBI UniRef50UniRef50_D2A2392e-4744.31%Gremlin 1 n=2 Tax=Endopterygota RepID=D2A239_TRICA
NCBI RefSeqXP_973724.13e-4844.31%PREDICTED: similar to Gremlin-1 precursor (Cysteine knot superfamily 1, BMP antagonist 1) (Increased in high glucose protein 2) (IHG-2) (Down-regulated in Mos-transformed cells protein) (DAN domain family member 2) (Cell proliferation-inducing gene 2 protein) [Tribolium castaneum]
NCBI nr blastpgi|910817896e-4744.31%PREDICTED: similar to Gremlin-1 precursor (Cysteine knot superfamily 1, BMP antagonist 1) (Increased in high glucose protein 2) (IHG-2) (Down-regulated in Mos-transformed cells protein) (DAN domain family member 2) (Cell proliferation-inducing gene 2 protein) [Tribolium castaneum]
NCBI nr blastxgi|910817897e-4845.41%PREDICTED: similar to Gremlin-1 precursor (Cysteine knot superfamily 1, BMP antagonist 1) (Increased in high glucose protein 2) (IHG-2) (Down-regulated in Mos-transformed cells protein) (DAN domain family member 2) (Cell proliferation-inducing gene 2 protein) [Tribolium castaneum]
Group
KEGG pathwaymmu:126225e-07 
 K01645 (CER1)maps-> Wnt signaling pathway
InterPro domain[194-308] IPR0041331.2e-34DAN
[218-312] IPR0062071.7e-13Cystine knot, C-terminal
Orthology groupMCL17681 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205727-TA
ATGTTATCAGAAAACTTAAAGAAGGCTCCGGATGCCCAGTTTCTATGGGCATTGTGGGAAGCAGATGGGCGGAAACGTAGTTTTGAGATCAGTAAGGCATCTGCGCCGGCGCCTGTGACAAAACATTGGCTTGTTTGGCGCCTGGCGATAGGGGCGCTCACGCTCATTACCGCTTCGCCGGCCGGTCCTGTGAACACGCATAACGCGTTGCTCACTAGTAGCGGACTTCACCGACGCACTACTGCCTACGATTTGTTGCCTCTTGTCTCTCATTACAACATGGAGAGCTTTTTGAAAAGATGCATCTGGCTCGTTGTTGCGGTGATTTTGGCTGCTGGCAATATAAGTTGTGGTGTTGGAAGCAAAAGGCCTTTTACGCCTACTGATTCAATTTTGGATATTATCGATACAGAACAAGTACAAAGGCTAATTGCGTCTCGACAACGCGCCGAGGAAGCGACGGCTCCTTCAAATTTTGTTCAAAGAAACGAACTTGGTGAAGGCAGCTCTGAGACAATGGTTGTTTCTGTCCCTGACCAAGAGGAATCAACAGAGGTACCAAAAGGAGCTGCATTTCGAAATATACTCCGGTCTTCGAAAAACGCATTACTCGTAACGAAAAAAGAATATCTTAAAGAAGATTGGTGTAAAACCGAGCAGTTAATTCAAAAAATTCGGGAGCCAGGATGTTTACAGGCGACAGTGATTAACAATTTTTGTTACGGGCAATGTAATTCGTTTTATATTCCAAAAGGACCACGGCGTCGCGAAGGAAACGACGAACGACCTCCACCTGCGTTTAAATCTTGTTCATTTTGCAAACCAAAAAAGTTTACCTGGATTACTGTTACGCTTCGCTGCCCTGGACAAAATCCTCCATTTAGACGCAAGCGCTTACAAAAAATCAAACAGTGCAAGTGTCTTCCAGTGGGTGTAAACTGA

Protein sequence:

>DPOGS205727-PA
MLSENLKKAPDAQFLWALWEADGRKRSFEISKASAPAPVTKHWLVWRLAIGALTLITASPAGPVNTHNALLTSSGLHRRTTAYDLLPLVSHYNMESFLKRCIWLVVAVILAAGNISCGVGSKRPFTPTDSILDIIDTEQVQRLIASRQRAEEATAPSNFVQRNELGEGSSETMVVSVPDQEESTEVPKGAAFRNILRSSKNALLVTKKEYLKEDWCKTEQLIQKIREPGCLQATVINNFCYGQCNSFYIPKGPRRREGNDERPPPAFKSCSFCKPKKFTWITVTLRCPGQNPPFRRKRLQKIKQCKCLPVGVN-