Monarch geneset OGS2.0

DPOGS212541
TranscriptDPOGS212541-TA4299 bp
ProteinDPOGS212541-PA1432 aa
Genomic positionDPSCF300315 + 23488-47998
RNAseq coverage487x (Rank: top 26%)
Annotation
HeliconiusHMEL0145450.059.23% 
BombyxBGIBMGA008130-TA0.043.84% 
DrosophilaCG5004-PA1e-3052.67% 
EBI UniRef50UniRef50_D6WES91e-5648.47%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WES9_TRICA
NCBI RefSeqXP_971453.12e-5748.47%PREDICTED: similar to pleckstrin homology-like domain, family B, member 2 [Tribolium castaneum]
NCBI nr blastpgi|3800281496e-5844.61%PREDICTED: uncharacterized protein LOC100871905 [Apis florea]
NCBI nr blastxgi|3320258016e-5824.12%Pleckstrin-like proteiny-like domain family B member 2 [Acromyrmex echinatior]
Group
Gene OntologyGO:00055152.7e-24protein binding
KEGG pathway 
InterPro domain[30-126] IPR0089842.7e-24SMAD/FHA domain
[29-126] IPR0002534.1e-13Forkhead-associated (FHA) domain
Orthology groupMCL25429 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212541-TA
ATGTCGGGTTTGCCAAATAGGGCGGATGACCCGATAAATGGAGTCGATGTACGGGAGCAAGGAAGCGCTTTGAGGGTCGCCACTAATACGCCTCATCTGGTGAGCCTCGGGACCGGAAGGCTAAGCACGGCTGTGACGCTACATCCCATCAAACAAGGTCGTGTGACCATAGGGTCGGATCCGACCTGCGACGTTTACGTTATAGGTACGGGGGTATCTAACGTACATTGCCGGGTGGAAAACTCCCACGACGTAGTCACGTTGTATCCCATCAGCGGCACCACGTTACTCGATGGCTTGCCCGTTGACAAGCCAACGAGATTATCGCAAGGTTCCATGCTAACGATAGGTAGGTCAAACTACCTTCGTTTCAATCACCCCGAAGAAGCTAAGCTCATGAAGTCTGTCCTGCCATCAGCTCACGTGTCCATGGCTCCAATACAGTTCACACCAAACGAACAATGTCTACCAACTGGCTACGAGAATCACAACTCGGACCCAAGCGACCAATGCTATCAAAACATCCACAGAGTATACAAAAATAATTCTCTAACGCAGTTAGACCGCGAATTAGATGTGACGTTGCGGGAAATGTCCCGGAACAAGCCGCCCGTGGTCCCCCGGAAGATGAATAGAGATCTGGACACGTCAGATACGAGCTCCGACCAAGAAACTAAACCGAAGGCCGGCAGCATCATGGCCAAAGTGTCCAAATTCGAGTACTACGCCAAGCAGCAGAAAAATAACAGCAAGAGTCACTTCTACACCAACGACGTGGAAATATGTCCCAAAGTATTCAGCTCAGACTCGCTGACCGTGAACACGCCGGCCAAGGACGTGCTCGGCGGGAGAAACGTCCCGGTTTATATGAACAAGGTGATGGATCCGAAGGTTATAGTGCTCCACGAGAACAACGCGGAGCAAAAAAATTCGACCAGGAGCAGGAAAATAGACGACATACTGAGGAATTTCGACGACACGTCCAAGTTACCCAAGAAAGTCAACAATGAATCCAAGGATCACATATATGGGAAAATTAATGTGGACAGGAAAGAATCCAAGGGGTCCAGCGACATCAGGCAGCTCAATATGGATAGTGATTACGGCAGGCTGTGCGGGGTGGGGAAGGTCGTGTGTTTGTCTTCGCCGGCCTATGACAGGAACCCGCAGTATTCGCCGGTATACGCCAATTCACACTACGAGAGGAGTCTGGAAGCAGAGAGTATGAGGGCTAAGGTATGTCGTTCGGGGATAGCGATGACTACGACAAAAGGAAAAAACGCTAGCAAACTCATAAAGAACTTCATCAAGAACCTAAAGAAGAACACGTCGGCCATGAGGAGACTGAAACGCGCATACGAGCTGCACTCCAACAAGAACAGCATACACGAGCTGATCAAAATAATTAACGAGAAGCTGGACGTGGTCCACAAAATACGGAACGACATCAAAATGTACAGGACAGTGAACACCGTCCTCACCAACCACTACATCAGGGAGACGAGGGACCGCGAGAGGGACATCGGCGGCAGAATGATAGACCTGGAGGAGCAAATACTACGAGCCTACGAGGGTCTGAGGAGCGACATCATGAAGATCGTTGCCAGCGACAGGAACGTGATAGCAGTGACAAATAACATGTACAGGAATAACGAGGCCAGTGACAAAAAACAGATATACGACGGCATTATTAGGAGAATTGACGGACTGTTCACTGGCGCGTCCAACGTACCGTCGTATTTGGATATTTGTATACACGGGGGTCTGAATATCCTGGTTTGCTTGAGCAAATTTGTAATGTTTGATACTAGTCACAAAAACACAAACAGTCAGCTCCAAAGGAATGTATCTGGTTACGAGAATTTTGAGTCGCCGAGAACCGAACCCGTGACACCGGTCGTCAGCACGGAGAATTACGAGAACATAGGACGAATGGACGAAATCGGCATACCGCTGTACGAAGACGCCAACAAGGAATTGTCCAGTCCGGTCATGATAAGGAAGCTCAGCCAGAACACGTCACCTATAAGCAGTCCCTACGAAAACGTCTATCTGAAGAACAACATGGGCATTAAACCTAAATCACCAAACACGCAGAGCCCCAGAACAAGAATAAAGACGAGTTTCGCACACAAAAACCTGCCATCACCACAATATTTCATCTTCCCGCCGACGGACGCTAAAACAGAAGCCGTTCACAACGAGACAGCCGGCAGCATCAGCTGCAACGGCATCGAGGAGAAATTGAGCTTAGACGATGAGATACCCATGATAGATGACTTGGATATCACCAACGATATAGAACTCAGACATTCCAAGGTCGAGAGGGACGAAGACAAGCTGATGACCAAAAACAAGAGCTTCGACGACGTCAAGAAAGAGTTGATGGCGGACATACCGGAGCTGGAGGAATTCGAGAACGATCTGAACAAGAGGGACAAGATACAGGGTGTGGCAGACGTCTTCAGGTCTATAGATATAGAAGAGAATTCTATAGATAGTGTCATATATGAGAGCGCTGAGGATCTGAAATCTAGATACGAAAATCTTAAAGAGGAGAGGCGAAAACTGATGGCACAGATCCACGACGTTAAGTGCAAAATGACAGAAATACGGTCCCAGGAGGACGACATCTTGAGAGAGCTGGAAATGGAAAAAGCCTTAATAAAAGGTGAATATGATTCAGAGATAGCCATCCTGAATATAGAACAGAAGAAGAAGACAGAGTTACTGGACAGTATTAAGAAGATAGAAGAAGATATACGACTGTTGAAGGACAAACAGGAAGCTCGCCAGAAAGAGATGAGAGATAGAGTTGACATCGCTACCATGAAAGTTGAAAAGCTCTCGAATAGAGAGGACGTACAGAACGAGCTAAGTTTATTGATAAAAAAAATAGATGAGCAAAAAACTCGAATACTAACATTGAAGTCGGAAGCGAACGACAACCTCACGTCCGCTGTGGAGGAAACGAAGGTCCTGCAAGCGGAGTATGTTAAACTACTGAATGAAGTGGAACACCTCACTGGCAAAGTTCAGATCATAGAGCAAGAGTTGAAGCCGATAGTCACCAGACTCAACGAGGACCGGACGCAGTCTCCAGACAGCGCCTTCTACACAGACCAGACACGGGCGAAGTCGGGGGAGTTCGGTTACTCGCCGCAAACATCAGACGACGACGATTTGGTGGACCATGCGAAGCAGCTGAAGGAAAAGTTCACGTCTATCGATCGGATGTCCCAGTCTATGATAGTGGACATAGAGAGACGGGTCGCTGACCTCGACGACCCGGTGACCAGCCCGTACAAGGCCAAAGAGAAGAAAGGCTTCTGGGAGAGGAACTTCGACTCGCTGAAACGGAAGAAGAAAAAAGCGGAGAAGGTGGACCTCATGTCGCAGAGTCTGAACGAAAATATTTTCTACAATGACAACATAGAAGCAGATAACCCGACAGACAATCACAACGGTCTGAAGAACTCGAAACGCAAGAACGTACCGCTGAAGAGCAATTCGTCGTCAAAGATACCAACATTCGCCGCGTTGGGCAAGATCATTAAGAAGGATTCGCTGAAACGAAAAGAGACGCAGAACAAAGAGAACAAGCCGGAGGAGACGTGCAACAGATACGTCAAGAATAACAAGACAACGGACAATAAATACAAGTCCAAGTCACTCTCACCCGACAAGCAACCCGGGATAGAAGAAACTACCTTCAAGGTTGAGAATTTGAAAGTTCTGCACAGGAAGAGCTGTGGAGACGAGCCCACGAGGGTCGACTACCAGAAGATAACGGATTCCGGGAAGATTCCATCCCAGGATGACATCGACAGGATCTCCAAGGTCACCATCGACGCCCCGATACTGCCCAGCGACACAGACGTCAACACCCTCGGCAAGAGAACCCTGGACAGCCTCATGGAGATTGAGAGGAAGAGAATTGAAATGCTCGAGGAAAAAGGCTGTCAAATGATCGAAAACGAACGAGAGAAAATATCCGAATTGAAGAAACGAGTGCAAGACGAAACTAAGAAGCTGTGGGACCAGCGCGCTAAGGACCAGAAACCTGACACGCGGCAGTCCCCTGTCAGCGTGACGGCTGTCAGCCTGACACCTGTCAGCTTGACAGCCGCCAACCTGGCCTGTGAGTTCGACAGCCCTAACAGCACGTTCGGAAACGAAGACGTCAGCTTTGACTACAACATGACCAGCTCTACAATATACGAAGAGAGGCATGGTTATGATAACCGAAGGTCACAGCTAGCCAAAAATCTGATACCACCCAAAAAGCTTCCGAAATAA

Protein sequence:

>DPOGS212541-PA
MSGLPNRADDPINGVDVREQGSALRVATNTPHLVSLGTGRLSTAVTLHPIKQGRVTIGSDPTCDVYVIGTGVSNVHCRVENSHDVVTLYPISGTTLLDGLPVDKPTRLSQGSMLTIGRSNYLRFNHPEEAKLMKSVLPSAHVSMAPIQFTPNEQCLPTGYENHNSDPSDQCYQNIHRVYKNNSLTQLDRELDVTLREMSRNKPPVVPRKMNRDLDTSDTSSDQETKPKAGSIMAKVSKFEYYAKQQKNNSKSHFYTNDVEICPKVFSSDSLTVNTPAKDVLGGRNVPVYMNKVMDPKVIVLHENNAEQKNSTRSRKIDDILRNFDDTSKLPKKVNNESKDHIYGKINVDRKESKGSSDIRQLNMDSDYGRLCGVGKVVCLSSPAYDRNPQYSPVYANSHYERSLEAESMRAKVCRSGIAMTTTKGKNASKLIKNFIKNLKKNTSAMRRLKRAYELHSNKNSIHELIKIINEKLDVVHKIRNDIKMYRTVNTVLTNHYIRETRDRERDIGGRMIDLEEQILRAYEGLRSDIMKIVASDRNVIAVTNNMYRNNEASDKKQIYDGIIRRIDGLFTGASNVPSYLDICIHGGLNILVCLSKFVMFDTSHKNTNSQLQRNVSGYENFESPRTEPVTPVVSTENYENIGRMDEIGIPLYEDANKELSSPVMIRKLSQNTSPISSPYENVYLKNNMGIKPKSPNTQSPRTRIKTSFAHKNLPSPQYFIFPPTDAKTEAVHNETAGSISCNGIEEKLSLDDEIPMIDDLDITNDIELRHSKVERDEDKLMTKNKSFDDVKKELMADIPELEEFENDLNKRDKIQGVADVFRSIDIEENSIDSVIYESAEDLKSRYENLKEERRKLMAQIHDVKCKMTEIRSQEDDILRELEMEKALIKGEYDSEIAILNIEQKKKTELLDSIKKIEEDIRLLKDKQEARQKEMRDRVDIATMKVEKLSNREDVQNELSLLIKKIDEQKTRILTLKSEANDNLTSAVEETKVLQAEYVKLLNEVEHLTGKVQIIEQELKPIVTRLNEDRTQSPDSAFYTDQTRAKSGEFGYSPQTSDDDDLVDHAKQLKEKFTSIDRMSQSMIVDIERRVADLDDPVTSPYKAKEKKGFWERNFDSLKRKKKKAEKVDLMSQSLNENIFYNDNIEADNPTDNHNGLKNSKRKNVPLKSNSSSKIPTFAALGKIIKKDSLKRKETQNKENKPEETCNRYVKNNKTTDNKYKSKSLSPDKQPGIEETTFKVENLKVLHRKSCGDEPTRVDYQKITDSGKIPSQDDIDRISKVTIDAPILPSDTDVNTLGKRTLDSLMEIERKRIEMLEEKGCQMIENEREKISELKKRVQDETKKLWDQRAKDQKPDTRQSPVSVTAVSLTPVSLTAANLACEFDSPNSTFGNEDVSFDYNMTSSTIYEERHGYDNRRSQLAKNLIPPKKLPK-