Monarch geneset OGS2.0

DPOGS204961
TranscriptDPOGS204961-TA1878 bp
ProteinDPOGS204961-PA625 aa
Genomic positionDPSCF300160 + 571957-576942
RNAseq coverage468x (Rank: top 26%)
Annotation
HeliconiusHMEL0037370.067.18% 
BombyxBGIBMGA011130-TA0.060.57% 
DrosophilaCG7706-PA7e-11147.81% 
EBI UniRef50UniRef50_UPI00022479C31e-12446.53%UPI00022479C3 related cluster n=3 Tax=unknown RepID=UPI00022479C3
NCBI RefSeqXP_972658.14e-13549.35%PREDICTED: similar to smad nuclear interacting protein [Tribolium castaneum]
NCBI nr blastpgi|910914468e-13449.35%PREDICTED: similar to smad nuclear interacting protein [Tribolium castaneum]
NCBI nr blastxgi|910914469e-14046.86%PREDICTED: similar to smad nuclear interacting protein [Tribolium castaneum]
Group
Gene OntologyGO:00055152.2e-29protein binding
KEGG pathway 
InterPro domain[89-199] IPR0002532.2e-29Forkhead-associated (FHA) domain
[50-195] IPR0089846.5e-28SMAD/FHA domain
Orthology groupMCL12931 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204961-TA
ATGTCTGACAACATAGAAAACCCCGACGTGCCGAGCTCACCCAAAATTGAATTTAAAAAGCCAATACTTTTCGGAAAGATCGGTAAACTACCAAAAAAGTCGAAAGCGGAACCTAGTGCAACAGAGGAAAAGAAAGATGAGGAAAAAAATGAAAATACGACTGAAGGTCACTCTAAAAGTTCTTTACCGCCAGCAGTTTTGCTAAAAGAATTATCAATTCCTATACCATACAAGGAGCCAAAATGGTCTGGATTTTGTCCGGAAGGATCGGACTATGCTTTGGAGGTACTGAAATCGGGTATGATCATGGAAAAAATCGATCTTACGAAAAAAGCCTTCTATGTATTTGGACGTCTTGCAAATTGTGATGTTGTTATGGCACACCCGACAATATCCAGACATCACGCTGTTCTCCAATACAAGGCCTTCGCTAATGACGACGAGCCAGCATCCGGGTGGTATTTATTCGACCTGGGAAGCACCCACGGCACGTTCCTGAACAGGGATAGAATAAAGGAGCAACATTACACGAGGGTCAGGGTGGGACATCAGATTAAATTTGGTTCTAGCACAAGAACTTACATTGTATTGGGTCCAGACTTTGATGCTGACGGTGAATCAGAACTGACAGTCACCGAAATAAGACAAAGGGCGCTCAACATGAAGCTGGAGAGAGACAGAATGATAAGAGAAGCCATAGAGCAGAGGGAGAGGGATAGAGTGGAGGAAGAAAGGAGGAGGGAGGAACAGGGAATTGACTGGGGGATGGGCGAGGACGCTGATGATGAACCGGATCTGTCAGAGAACCCATACGCCTGTACAGCAAACGAGGAGTTGTTCCTGGATGATCCAAAGAAGACACTAAGAGGTTACTTCGAGAGGGAGGGTTTAGAACTGGTGTACGACTGTGATGAACGAGGAATTGGCCAGTTTCTGTGCAGAGTGGAGCTCCCGCTAGACGACGCCAGAGGCAGGCCGCTTGTAGCGGAAGTGCTTCACAAAGGAAAGAAAAAAGAGGCTGTGGTGGCTTGCGCTCTAGAAGCCTGCAGGATACTGGACCGAGCTGGGTTGCTACGACAAGCCAAACATGAGTCCCGCCGTAAGAAACAGCGTGACTGGTCGGCGGACGACTACTACGACTCCGACGATGACACCTTCTTGGACAGGACCGGGAGTGTGGAGAAGAAGAGACAGGCCAGGATGGAGAAGAACGGACTGAAGGACACTGAGAAACCACTCACATACGAGGATCTGCTCAAACAGATAACGGACATTGAGAACAAAATAGCATCAGAAGAGAAGATTCTAGAAGCTCTGCGAGTGAAGAGCAAGCAGAGTGAGCTGGTCGACCACGAAGAGGATGCCTTGGACGAGTTCATGAATACTCTGCACACGGGACACAGCATGGCTCATAAGGCTGAGATATCCAAAGCCAAGATGAGCATACAGAAGCTAAAAACCGATCTGTCAAAAACCCGTCGCCTGTGCGAACTGGCTCGCCCCGCGGACGCTCCTCCCCTCCTCAAGAAGGACAGCACACCCGCCATTAAACAGACACACGCAGTCACATACGGCAAGAGGATACGGTTAAAAGACGACAAACCGAAGCCAAAGATCATCAAACAGAGCAAGCGAGAAGAGGAGTTCGTTGAGGAAATGGACTCCGACGAAGATAGTGAATCAAAACCCACACCCATCGTGGAAACTGAAAGCAAATCTGATAGTCCAGTCAGAAGAGACAGCGATGGCACCGTGGCTGTGGAGACGAAGAAATTGTATGGTCCGATGAGGCCGCCGGAGAATTATGTTGTACCCGAAAATTATTACGACGAAGCAACTGACAGGGACCTGCCGGAAATAGAAGAAGGAGTTGAATAA

Protein sequence:

>DPOGS204961-PA
MSDNIENPDVPSSPKIEFKKPILFGKIGKLPKKSKAEPSATEEKKDEEKNENTTEGHSKSSLPPAVLLKELSIPIPYKEPKWSGFCPEGSDYALEVLKSGMIMEKIDLTKKAFYVFGRLANCDVVMAHPTISRHHAVLQYKAFANDDEPASGWYLFDLGSTHGTFLNRDRIKEQHYTRVRVGHQIKFGSSTRTYIVLGPDFDADGESELTVTEIRQRALNMKLERDRMIREAIEQRERDRVEEERRREEQGIDWGMGEDADDEPDLSENPYACTANEELFLDDPKKTLRGYFEREGLELVYDCDERGIGQFLCRVELPLDDARGRPLVAEVLHKGKKKEAVVACALEACRILDRAGLLRQAKHESRRKKQRDWSADDYYDSDDDTFLDRTGSVEKKRQARMEKNGLKDTEKPLTYEDLLKQITDIENKIASEEKILEALRVKSKQSELVDHEEDALDEFMNTLHTGHSMAHKAEISKAKMSIQKLKTDLSKTRRLCELARPADAPPLLKKDSTPAIKQTHAVTYGKRIRLKDDKPKPKIIKQSKREEEFVEEMDSDEDSESKPTPIVETESKSDSPVRRDSDGTVAVETKKLYGPMRPPENYVVPENYYDEATDRDLPEIEEGVE-