Monarch geneset OGS2.0

DPOGS215750
TranscriptDPOGS215750-TA1659 bp
ProteinDPOGS215750-PA552 aa
Genomic positionDPSCF300041 + 927092-954039
RNAseq coverage263x (Rank: top 41%)
Annotation
HeliconiusHMEL0040830.081.84% 
BombyxBGIBMGA003625-TA3e-8072.58% 
DrosophilaCG13188-PA2e-14057.37% 
EBI UniRef50UniRef50_A1Z8R13e-13857.37%CG13188, isoform A n=21 Tax=Neoptera RepID=A1Z8R1_DROME
NCBI RefSeqXP_308422.43e-14354.00%AGAP007416-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582856956e-14254.00%AGAP007416-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582856957e-14955.34%AGAP007416-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055151.4e-46protein binding
GO:00063551.1e-31regulation of transcription, DNA-dependent
GO:00056221.1e-31intracellular
KEGG pathway 
InterPro domain[27-256] IPR0089841.4e-46SMAD/FHA domain
[34-231] IPR0011321.1e-31SMAD domain, Dwarfin-type
[97-258] IPR0178552.1e-21SMAD domain-like
Orthology groupMCL14780 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215750-TA
ATGGCATCAATCACGAACTTTCGCGTTGTCGGGACCAACTATTCAGTTTTGAAAGAACACATACAAGAAGTTTTAGACAAATGGACTCAAATCGACGATGAGATATGGGCGAAGGTTATTGTATTTGAAAAGAATCGGAGAGTGGCAAAGGCATACGCTCGTGCGCCTGTGCTTACTATCAACGGTTCTGATGATGGATTCGATGGAATGAGAATTCATAAGTGCCATATCCAAGTAAACAATGATCCTTGTCAACATGGCTCGCGCTTCCTACTCGAGACTGAACCGATCCGAATTGGCCTTTGCGGATTCGACAACCCACATAGAGATCAAAAAACTGAAGAACTTAAAAAACACATAGGACAGGGTGTTAAAATAAAAATGGATGATGCTGGGAACATACTCATCAGACGCTATTCCAAGAGTAGCGTGTTTGTGAAGAGTACTGCTGCCACCAGCAACGAGGAGACCGCGATCGGGCAGGATATAGTCAAGCTGCCTGGTTATTCTCTGGAACAGGAAAAGATATTTAAGCTATTCGATATGAAGAAATTCCAATCCAACGTGAACAGGGAGCTGCGACGTGCCTACCCTGATCGAAGACGTTTGGAGACTCAGTGCCTCAGTGCCGTAGCCTTCGTTAAATCTGACAGCGAACTGCTAGAGTGCCCGATATGGGTGCTGGTCATCAATGTTGTTGCTATGGACATGCTCAAATCAAAGTTGCCGCCAGTTCAGAGACCTATGGACATAAAGAACAGGCCTCGTATCCCAATACCTGACGAGGATCCTTACAGTATTGCTAGCTCCAACGCTAACGGTTCAGGCTCTAGTGGTAGTTCTGGTGGCTACGGACATAATGGACATAACGGGCATAACGGACACAATGGCCATAATGGCCACAATGGCCACAACGGTCACAATGGTCATGGAGTCGTTGCTGCAACAAGAGAGCAACTTATGATGCAAATGGGACAAAGGAGAGGCGATAAGCCTCCAAAACTACCACCTAGGGAAAATGGCTACGGTCCAGCCGACATTCCAAAGGCAAACAGAGTTATGGGGTCTAGTTGGGAAATCCCTCCGGACTATGACGATATTGAAGTTAACGAACGTGTTCCGGCACAGTTCCCCAGAGGAAAATCTGACAAAGGAAAAGATAACAAAAAATACGATGATCCTTACTACTGCGGATTGCGCGCCCGTGTTCCCAACTTCGTGAAGGCTACAGGCAAGCACGTCCTGCCTGCAATGACGGAGCGGCTTACGCTGAAGGAGACACCTGTACCCATCAAGCGTTTTAGCGTCGCTCATCCCCACGGCTTCCCACCCCAAATGCCCTTCGCACACCCAATGCAGCAAGCGCTGTGGCACGCTCGCTCCTACGAAAGCGGAATCGGTGCGTACGAACGTCCCAAAAAAGAGAAACTGATGTATCAATTCTACCCCGACAGGGATCGAAAACTACGACCAATGCGCCTGCTGACTACCGCTGGACTTGGCCATAGCTTCGCATTAGTAACTGTTCTAGGCACCGACACACTAACTAATGGGGAGCATTACGACCCTTACGCGCTGTACGGCCGTCTGCCCGCAGGACGCTTCCCGCCGCCGCCGGCGCCCAACGCGCGACATATGTTCATTGGAGAATGGGACTAA

Protein sequence:

>DPOGS215750-PA
MASITNFRVVGTNYSVLKEHIQEVLDKWTQIDDEIWAKVIVFEKNRRVAKAYARAPVLTINGSDDGFDGMRIHKCHIQVNNDPCQHGSRFLLETEPIRIGLCGFDNPHRDQKTEELKKHIGQGVKIKMDDAGNILIRRYSKSSVFVKSTAATSNEETAIGQDIVKLPGYSLEQEKIFKLFDMKKFQSNVNRELRRAYPDRRRLETQCLSAVAFVKSDSELLECPIWVLVINVVAMDMLKSKLPPVQRPMDIKNRPRIPIPDEDPYSIASSNANGSGSSGSSGGYGHNGHNGHNGHNGHNGHNGHNGHNGHGVVAATREQLMMQMGQRRGDKPPKLPPRENGYGPADIPKANRVMGSSWEIPPDYDDIEVNERVPAQFPRGKSDKGKDNKKYDDPYYCGLRARVPNFVKATGKHVLPAMTERLTLKETPVPIKRFSVAHPHGFPPQMPFAHPMQQALWHARSYESGIGAYERPKKEKLMYQFYPDRDRKLRPMRLLTTAGLGHSFALVTVLGTDTLTNGEHYDPYALYGRLPAGRFPPPPAPNARHMFIGEWD-