Monarch geneset OGS2.0

DPOGS208198
TranscriptDPOGS208198-TA5055 bp
ProteinDPOGS208198-PA1684 aa
Genomic positionDPSCF300179 - 258912-265470
RNAseq coverage804x (Rank: top 16%)
Annotation
HeliconiusHMEL0030610.042.76% 
BombyxBGIBMGA002310-TA0.047.44% 
Drosophila% 
EBI UniRef50UniRef50_E0VA114e-2934.78%Golgin IMH1, putative n=1 Tax=Pediculus humanus corporis RepID=E0VA11_PEDHC
NCBI RefSeqXP_002422955.18e-3034.78%golgin IMH1, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420040562e-2834.78%golgin IMH1, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420040566e-4422.15%golgin IMH1, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055155.9e-11protein binding
KEGG pathway 
InterPro domain[1-95] IPR0089845.9e-11SMAD/FHA domain
[13-92] IPR0002531e-09Forkhead-associated (FHA) domain
Orthology groupMCL26682 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208198-TA
ATGTCGGGCGGGCGCCTGGTGGTGTTGGATCGCTGGGGGCGCGACGTGAAGTGTTTCCCTCTGAGGGCGGGCTCGGCCAGCATCGGCAGCGACCCTTCCTGCGACGTGCGAGTGTTGTCTCCGGCTGCGCTCACTCTGCACGCCACGCTGGCCGTGCGACCGGCCGGGGCGGTGCTGCGCTCCTACGGACAGACCAGTGTCAACGGAGCCCGCGTCAGTGTTGCGGCGCTCCGGCACGGAGACACGCTCACGCTGGCCGGCCGCCGTCTGCGCTGGGACTATGACCGTCCCGACAGACACCGCCGCGCCGCCTCGCCCGCGCCGCCCCTCACCACTGCCGCCCCCGCTCGCTCGCCAGGTCGCGGTCGTCGCCGCAGCGAGCCCGCGCGCCGGCCCGCGGTTCATTCTCTCCACAGGGACTCCGCCCCCGCCTCAGCGGGCAGGAAGCAGGTGGCGATAGTGCAGCCGCAGAGAAGAGACGCCAGCGACAATAATGAGAGCCCCGAGGGATCGTCCGTCAGCTCGGGTCCGTCCCCGCGCGGCCGCCGACCTGACGACGCTCACGTCGGAATACGCAAGTCCGGCCAGAGCAAGTCTCCTAAGCCGCTGGGTTCGGACGACACCACGAAGGCGACCCTGTGGATCGAGTCCCGCAAGTCTCGCTCTCCTCGTAACTCTCCTAAACCTAGCCAGGAGCCGCCGCCGAGGAACTCTCAGAGGAACGCGTGTAAGCCCGGCGAGAGCCCGCGGCTGTCTCGTAAGAGATCCTCCACTGCGCCGACCGTTTCTCCCAGAAGCGTGTCCAAGGTCCCGCGGCTCTCTGCCCGGGGGACTCCTCGCGTTTCCACCAAGAGCGCTCCTCTCCGACTGGCGGTACTGAAGAGGGCGCACTCCGCACAGTACAGGGTCACCAAGATACAGGCGCCGCTCAAAATAGACCACACCAAGCAGGCGGCCATCATGTTAATGACGGGTCACAGTCCTCGGCCTCACGCGCGAGGACTCTCCTCGCCTTCGCCCTCGCCTCACTCACCGCGCGTTCTCAACGTCCGAGAGACTCCTTCGATCACTCCTAGACGTGCTGCCGCCTACAGGACCCCTACCGGGAAGGTAAAAGAAACTGTTAAGAAAACCTCGTCCGGGGGGCGGCGGGCCGCGAGCTATGATGTGCGGTCGCCGACTTTTGCCAGTCCAAAGAAATCTGCTATGAAAGATCCAAAGAAAAAGAAGAATTTGAGAAAGACGGAATCGATAAAGTTTGATTTGAGCAACTTAGACAATTGCAGTAACTTGAGTAACTTAGAAAACGAGGAGAGACACATAAATTCTGATATTATTTTGGTCGAGAGTGTTGATGATCGGTCTCAAGACAGCGTCGCCGAAGACGATCTCACTCTACGTTATTCGGATACCTCGAGCACGAAATCACCCTCCCCGAAAAAAATTATACATTCGAGAAGTAGTCGGATTATTGAAAAAACTCTGGGATCTCCCATGACGACTACTGTATCTGAATCTAGTGCGTTGACGAAACAGTCGCCGCGGTCCAAAAAATCTTTTAGGGGAAGTATTATCGTTCAGAAGGCGTTGGACGAATCGAACTCACGATATTCGAGTCGAACGACAAAGAGTATTTCAGACGAGAGCTACAGCACAACCATAACCTCTCTCATGACGGACGGCCTCAGTCCGAGCTCACATCGAGATATAGAAACGTATTCCATAGTAGACCTGGTGACGATAGATTCAAATGGCTCCGGGTCATCTATATACAACTCTGTTGGCTCTGATAGTAAAGCATCATTCGGAACTCCTCGAACAGTCGCCACCAGAAGGACGAGATCTACTAATCCTTCCCTGTTGGGTTCGAGTACGCCTTACACGAAGAAAACGAATAATCGTGAAGTTACAGACAAGACTGCGTCCATAACATCTTCTACGAGCAAGAGATCGACATCTATAACCACTCCAGAAAATACCCAGAACATATCGTTCAATAGTACGAGGAAGTCGCGAGCCTCACGGAGCAGGTCCCGTATTAATGACAGCGATGTACTTCTCGTGGTGGAAGACGAAAACGAAGATTCCTCTCCTAAATCTTCAAGGCGTTCTAATAGAAGTATGTTGAGTCCAATAGTAAGGAAGAATATTTCTATTACAGTGTCGCCCTCCGATAGTCCCACTCCGGGTACTATAACTCCAGAAAACAGGTACAGTCCACAAGATGTCGGGACTCCAATTCTAAGCATTCAAAGCCTTTTAGAGCAGAGTGCTGTCGATCAGACATCCAAATCCTTGAAGAACTTTAAAAAATCGAAGAGAAAAACCACAGGAGCACTCTTCACGCGACCGCGAACATCTAGACTCAGCGTTAAATCTAAATCGTTAAACTTGACTCAAAGGAGGTCTTTGCGAGCCAGAAGAGCTTCTTGGATTTCGTTTAATGCAATCGGCACATCTCAAGATGCCGAACAGGCGACCACACCTAAGAGTGCAGTCAAATTGATACAAGAAGGAGTAAAAAATAAACATTCAACGGCCAAGAAGCCCCAATCAAAACGCTCTCTTATAGACGATCTTGACGACTCTGACTTAGTAAAGGAATTGTTTAACAGCCCAGTCAAGCGGAAACTTTCCCAAAGCATGACCGAGTTTTCAAAGAAGCAGTTGTTCGACGATGATGTTCTGCCTGCCAAGCAAACCAGAAACACTATAGCGGGGGCTGGCAGAACTCCCGACGGTTCATTCATCGATCAATCTCAAGCGATCACTCCGGAACTATTCGTCAGTCCACTGAGCACGCCAAGTGACAGTCCCAACCTCGTCGGAATTAAGAGATTGTTTGCAAGGAATACACCAGATAATGACTTGCGTAACGTTCGAGGGGTTAAAAATATTCTACGAACCCCTAGGGCAAGGAAGTCTATCAACAATGACCTATGTAACGTATCTGGAGTCAAAAAAATATTCGGTAAATCACCGAGGAATAGATTGAGCGACGTGAGAGTCAAAGAAGTTTTCGCCCAATCACCGAATGACGACTTGAGAAGAATTTCGGTCGTGAAGACTTTATTCCAAACTTCAACAAACGCATTAGAGGACGTTCGAGGAGTAAAAGACCTTTACAGAAAGAGTCCTCGAAACGACCTGCGTGACATATCTGGTGTGAAGAACACTATGAGAGCCAATTCGCCGAGAAATAATTTGTCGGACATGAGAGGCGTGAAGCAACTGTATCGGGAACAGTACTCCAGGAACAATATCAGTGATGTGAGCGGAGTCGAGGAACTGTTCCACGAGTCTGAGACTCTCGACACAACCTTCGACCAGCTACTGGGGCGACCTCGAGTGCGGGAGTACACCAAGGCCAACAGTTGCAGCAAAATCGACAAACGAAAAAAGAACCAAACGCGATCCGCGAAGTCTCTACACGACTCCATCGGACCGATCACCGACAATGTGGAGGCCTGGCTCGAGAGTGAGCTGAAGAAGCGTGCTCGTGCCACGGAAACAGAAGCTTCCAAATCTGCCAGGGAACTACGGAAACTGACCATAGACACTGTTGAAGGACGAACGCCACTCGCGTCTTCCAGGAGCCGTAATTCAATGTCAATAAAAGAACAGTCTGGTGAACGTCAAAAGTCCGCTTCAGAGCTATACAGCGCCCGCAAGTTGCCCATAAAGAAGAGGTCGCTGGTGGCGCGCGACGATTCGGCGGGAGGCGACAGCCGGCCGGGCGGCGAGCTGCTGCCTCTCAAGAAGCGGCCCGTTCTGCACTCCACGCCCGTCAAAGGTCGAGAGCACACTCTCAACGCGTCTGAGTTAGGACGAGTTTCACCCATAGCGCTCGACGACACACGAACATTGCAGTCTAATACAGAAGCGCCGAATCCGAAGAACCTCAGGGTGAGGCTGCCCCAGGCGGCGGAGGAAGATTTGGGAAAAGCGTCAACGAAGCGAACTCGGTTTAAAGGTGGTAGTTCAGCGCCGAGTCAGAGACAGGTTGCCGCGGAGAAGAGAGAAGTCACGAGCCCAAAGGTCGTGAGGTCTACACGACAGAAAAAACACGTGGTAGGGACACCGAAGAAGACTAGAGCTAGAGTGGTGGAGGTTACTGTCGTGGTGACTAAACCATCACCCGTCAAGAAACAAATAAAGCGACAGCGTAGCGATAAAAACAAACCTGCAAACATTGAAGAAGAAAAGCCTAAAAGAACGAGAAATGCGAATCTTAAGAAAGAGACTTTGAAGGAAACCAAATCTACGAAAGTCGTTAAGAATAATATAAACGAAACGAAGACTAAAAAGGCAAAGAGATCTGTCGAGGTGAAAGAGAGCGTACAAGTAGAAACCACTGGACCGAAACGACGAAGAAAGGCAGCTCCCGAAAACAATGCTAAAGTAGAAAATGAAGCAGTAGAGGGCCCTGTGAGGAGGCGGGGGCGGAAACCAAAAGATACAGGAAGTGAAAGAGTGACACGAAATATTAAGAGGACTCTAAAGGAACAATGTTCTGACACTGAAAATGTGCCAAAGAAAAGTACCAGAAATAGAAAAGTTACGGCAGTAGACACAGATAGAGATACAAAAGGAAAAAAAGTTGTTATAGTAGCACCGAGCTCAAGGACGAGGAAACGATCAGAGAAGAAAGTAGAACATAGTGACGGAGATACAGGAGTTAGGAGAAGTAGGAGAGGGATTAAAGATATTGATGATACGAAGAAGAGTGACGCTAAGACAAATGAACAAAAAGAGACTAGAGACAAGAAGACTACAACGAGGAGAGCGGCCGAGAGTGATGGAGAGAAACGAGGGAGAACCACCAGGTCGACGGTTGCCGGGGACGGAAATAAACATGACACTAAGAAGAAGGCAGCGGCCCCGGGTCAGGAGCCGCGCCGCAAGCGACACGCGGGGCAAGACGACACCCGGGAAGAACCGTCCGTCGGCAGTAAGAGACGCCGCGCCGCCCGAGTCGCAGCTCCAGGTATAATGACACAACAACACTTCATAAAACCGTTTTATTTAAAATGTCACAACTCACCACCAGTATGA

Protein sequence:

>DPOGS208198-PA
MSGGRLVVLDRWGRDVKCFPLRAGSASIGSDPSCDVRVLSPAALTLHATLAVRPAGAVLRSYGQTSVNGARVSVAALRHGDTLTLAGRRLRWDYDRPDRHRRAASPAPPLTTAAPARSPGRGRRRSEPARRPAVHSLHRDSAPASAGRKQVAIVQPQRRDASDNNESPEGSSVSSGPSPRGRRPDDAHVGIRKSGQSKSPKPLGSDDTTKATLWIESRKSRSPRNSPKPSQEPPPRNSQRNACKPGESPRLSRKRSSTAPTVSPRSVSKVPRLSARGTPRVSTKSAPLRLAVLKRAHSAQYRVTKIQAPLKIDHTKQAAIMLMTGHSPRPHARGLSSPSPSPHSPRVLNVRETPSITPRRAAAYRTPTGKVKETVKKTSSGGRRAASYDVRSPTFASPKKSAMKDPKKKKNLRKTESIKFDLSNLDNCSNLSNLENEERHINSDIILVESVDDRSQDSVAEDDLTLRYSDTSSTKSPSPKKIIHSRSSRIIEKTLGSPMTTTVSESSALTKQSPRSKKSFRGSIIVQKALDESNSRYSSRTTKSISDESYSTTITSLMTDGLSPSSHRDIETYSIVDLVTIDSNGSGSSIYNSVGSDSKASFGTPRTVATRRTRSTNPSLLGSSTPYTKKTNNREVTDKTASITSSTSKRSTSITTPENTQNISFNSTRKSRASRSRSRINDSDVLLVVEDENEDSSPKSSRRSNRSMLSPIVRKNISITVSPSDSPTPGTITPENRYSPQDVGTPILSIQSLLEQSAVDQTSKSLKNFKKSKRKTTGALFTRPRTSRLSVKSKSLNLTQRRSLRARRASWISFNAIGTSQDAEQATTPKSAVKLIQEGVKNKHSTAKKPQSKRSLIDDLDDSDLVKELFNSPVKRKLSQSMTEFSKKQLFDDDVLPAKQTRNTIAGAGRTPDGSFIDQSQAITPELFVSPLSTPSDSPNLVGIKRLFARNTPDNDLRNVRGVKNILRTPRARKSINNDLCNVSGVKKIFGKSPRNRLSDVRVKEVFAQSPNDDLRRISVVKTLFQTSTNALEDVRGVKDLYRKSPRNDLRDISGVKNTMRANSPRNNLSDMRGVKQLYREQYSRNNISDVSGVEELFHESETLDTTFDQLLGRPRVREYTKANSCSKIDKRKKNQTRSAKSLHDSIGPITDNVEAWLESELKKRARATETEASKSARELRKLTIDTVEGRTPLASSRSRNSMSIKEQSGERQKSASELYSARKLPIKKRSLVARDDSAGGDSRPGGELLPLKKRPVLHSTPVKGREHTLNASELGRVSPIALDDTRTLQSNTEAPNPKNLRVRLPQAAEEDLGKASTKRTRFKGGSSAPSQRQVAAEKREVTSPKVVRSTRQKKHVVGTPKKTRARVVEVTVVVTKPSPVKKQIKRQRSDKNKPANIEEEKPKRTRNANLKKETLKETKSTKVVKNNINETKTKKAKRSVEVKESVQVETTGPKRRRKAAPENNAKVENEAVEGPVRRRGRKPKDTGSERVTRNIKRTLKEQCSDTENVPKKSTRNRKVTAVDTDRDTKGKKVVIVAPSSRTRKRSEKKVEHSDGDTGVRRSRRGIKDIDDTKKSDAKTNEQKETRDKKTTTRRAAESDGEKRGRTTRSTVAGDGNKHDTKKKAAAPGQEPRRKRHAGQDDTREEPSVGSKRRRAARVAAPGIMTQQHFIKPFYLKCHNSPPV-