Monarch geneset OGS2.0

DPOGS213910
TranscriptDPOGS213910-TA1011 bp
ProteinDPOGS213910-PA336 aa
Genomic positionDPSCF300218 - 143039-150323
RNAseq coverage79x (Rank: top 64%)
Annotation
HeliconiusHMEL0060703e-16180.84% 
BombyxBGIBMGA004618-TA1e-17085.71% 
Drosophiladaw-PA2e-6138.37% 
EBI UniRef50UniRef50_D7EL532e-8646.73%Dawdle n=1 Tax=Tribolium castaneum RepID=D7EL53_TRICA
NCBI RefSeqXP_970355.14e-8746.73%PREDICTED: similar to Activin Like Protein at 23B CG16987-PA [Tribolium castaneum]
NCBI nr blastpgi|910950237e-8646.73%PREDICTED: similar to Activin Like Protein at 23B CG16987-PA [Tribolium castaneum]
NCBI nr blastxgi|910950235e-8546.73%PREDICTED: similar to Activin Like Protein at 23B CG16987-PA [Tribolium castaneum]
Group
Gene OntologyGO:00080831.4e-32growth factor activity
GO:00055763.6e-12extracellular region
GO:00400071.9e-09growth
KEGG pathway 
InterPro domain[13-336] IPR0156151.6e-77Transforming growth factor-beta-related
[232-336] IPR0018391.4e-32Transforming growth factor-beta, C-terminal
[92-116] IPR0024053.6e-12Inhibin, alpha subunit
[14-210] IPR0011111.9e-09Transforming growth factor-beta, N-terminal
Orthology groupMCL17344 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213910-TA
ATGGCGTGCGCGAATTGTGCGAACGGTGATCTTAGTGTTACTGAGCTTACGGATGTAAGGATACAGTTCGTTAAACATCAGATCCTGAGGAAGTTGAGGCTGAGCCGGAAGCCGAACGTGAGTGTGCCAGTCAACAGCCTGCCAATGCCAGTCGCCCAGGGACAGACCAGCCAGGGCACTGACAAACCATCGGAGTTCAACGATTACTACGGAAGAACTGAACAGAAAATCATTTTCCCAGTGGAAGGTGAATGTCTCACTTCAGGGCGCTATCCTTTCATATGTCTGCAGTTTGAACTACCTCCAGATGTAGAGCCAGAGGAGGTCACTGTGGTAGAACTCTGGTTCTATAAGGAGCGTGATCCTCTTGATGAATATAACCAGACTTTCGTGATCTCTGAGGCGGCGCATTGGGACAGCCAACAGAAGTTCAAGAAGACCAAACCTATAGCCATCAAAGAAACTGATATTTTGGAAGGTTGGGTCAGAGTGGAGCTGGCTTGGGCGGTGAGAGTTTGGTTAGAAGGCAGAGACAGACTGCACACCCTACACGTAGCCTGTAGGACGTGTCAGCTCCGTCGTGCTCCTCTCTCATTTCACGAGAAGCATCGACCGTTCCTAGTGCTGTACACCAAGTATGCTGGGAAACGTCGTCGAGGGAGGACCCTGGAGTGTGGGGCAAATACCAGCGAGTGTTGTCGGGAGCCGTTATACGTCAGTTTCAAGGAACTTGGCTGGGACGACTGGATCATCAGACCCGAGGGTTACCACGCATACTTCTGTAAAGGGAACTGCGCTCCGATATACGCTGTCTCACAGGCTGACAGCTACCACCATAATATAATCAGAAAATACTTCTATTCGGTGAACGATAATCGTAGAGGGGAGTTCAAGCCTTGCTGTGCGCCGACTACCTTCAGTTCGTTACAACTGCTCTACATGGACTCCAACAACACCGTCACTCAAAAAACTCTACCGAATATGGTCGTAGAGTCCTGTGGTTGTATGTGA

Protein sequence:

>DPOGS213910-PA
MACANCANGDLSVTELTDVRIQFVKHQILRKLRLSRKPNVSVPVNSLPMPVAQGQTSQGTDKPSEFNDYYGRTEQKIIFPVEGECLTSGRYPFICLQFELPPDVEPEEVTVVELWFYKERDPLDEYNQTFVISEAAHWDSQQKFKKTKPIAIKETDILEGWVRVELAWAVRVWLEGRDRLHTLHVACRTCQLRRAPLSFHEKHRPFLVLYTKYAGKRRRGRTLECGANTSECCREPLYVSFKELGWDDWIIRPEGYHAYFCKGNCAPIYAVSQADSYHHNIIRKYFYSVNDNRRGEFKPCCAPTTFSSLQLLYMDSNNTVTQKTLPNMVVESCGCM-