Monarch geneset OGS2.0

DPOGS203692
TranscriptDPOGS203692-TA4647 bp
ProteinDPOGS203692-PA1548 aa
Genomic positionDPSCF300010 - 1972485-1995932
RNAseq coverage245x (Rank: top 42%)
Annotation
HeliconiusHMEL0133120.064.55% 
BombyxBGIBMGA003486-TA0.056.44% 
Drosophilabbg-PC2e-2344.20% 
EBI UniRef50UniRef50_UPI0002246D245e-3842.33%UPI0002246D24 related cluster n=1 Tax=unknown RepID=UPI0002246D24
NCBI RefSeqXP_001606112.15e-3942.33%PREDICTED: similar to prIL-16 [Nasonia vitripennis]
NCBI nr blastpgi|3454877422e-3742.33%PREDICTED: hypothetical protein LOC100122506 [Nasonia vitripennis]
NCBI nr blastxgi|2700148027e-11627.61%hypothetical protein TcasGA2_TC010783 [Tribolium castaneum]
Group
Gene OntologyGO:00055153.8e-19protein binding
KEGG pathwaydre:3687236e-16 
 K06092 (INADL, PATJ)maps-> Tight junction
InterPro domain[1326-1441] IPR0014783.8e-19PDZ/DHR/GLGF
Orthology groupMCL22032 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203692-TA
ATGCGTCATATGTGTACAATACAGACCAGCCCAGAAGCAGTGATTAGAACTGCAGGGCCAAGTTTGGTATGCGGCATGGAGGCAGACGCGGCTGTCGAAGAGTTTGTCACAATAGTGCCGGTTGGGGAAAATAGTACTGTTGTACAGGATCCAGAATTTGTTACAGTGCTCAACGTCAGTGTGACAAAAGGCGGCGCTGAAGTTATTGTAAACAGACCACGTGGTGCTCGATTAGGTCTAGGACTAAAGTTTGAAGGTGGATCAGCTGCAACAGAGAAAGTTCGAAGACTTTTAGTACAATCATGTGCGGAAGATAGTCCTGCTGCGAATGCGTGTACTCCTTGGGGTAAATTGATATCAGGGGATGAAATTTTAGCTATAGATGGGACTCCAGTATCTGAATTAACAAGAATCGAATGTGTACGTCGTTTAAAAGATTCAGATGAAAGTTTAATTCTTCTTGTCAGACATTTTGAAACATTGGATAAAGTAGATCAGCAAAATATGGAAACAAAAGAAACAGTGATGACTGACTCGAAAACTTTTGTTGATATATCAAGACCTCTGACTTTACCTCCCCCCGTCCCTCCGAGGAAATTGGGTAAGAAAAATTCCTTTAAGGATAAAACAACCCTTCAAACTGTGGGTGAACAGAACGCTCTCATGAATTTAAATCTGTCTGACACTGGTAACGATAAAGATAGTATTATAAGTGCATTAGTCCACCAACATTCGATGGTTTCAACTATGAATAAATTATCACGGAAATCGTCTTTTGATAGCCATATACATCGTCAAACCAAGGAAAGTTTTGCACACTTAAAAAAAGGTTCTCCTGAAGAAGTGAGGCGTCTAGTACGTCGACTATCAGACGGCAAAACGATACCACCAGAAGCTGAAGTTTATATTGATTTACTTTCGAATGAGTGGGAACGCTGTCTTGTATTAGCAGATGCTGAATCAGACGATACTGGTAGTTCTATATCTACTGTAGTCGATAGACTAGGATCTATGGCTAGTTCTGTAGAGAATAGTATTCCTTCAACTCCAGTTATGCAACAGAAATCAATTGATATTCAAAAGGTACTCAATAGCATTGAAAATATAGATTCTGATATATTAAAAGACATGACGAACGTGAAATATTCTGAAACTAAAGTAGATAGAAATGATGATCTTAAAAGATGTTTAAGAAATTCTATGACAAATGGACAAAAAGTGTCTAATACAAATTCAAACATAGTTACAGAAAATATAAAACCAAAACAATGTGAAAACGATAACAATAACAACGAAACATCTAAAACTTTGGCACCAAACGATACGTTTGCGATAAATGACAAGAAAATGAAGGAGGATGTAGTGCTCGATAAACCTAAACCAATGCCACGCAGCACTAAAGTGGAACGTTCTGCAAGTGAAAAGAAACGTCGTCCTGTTCCAATACCAGAAGTACAGCCTCCTAATATTTCGGAACCACTTTTAACACCTACAAAGAAAACATGTATTGAATCGTGGTTACAGCGCTCAGAAGCTGAAATGCAGAATACCTCGACAGAAAAAATAGAGGAAACTACTATTCCTGAAGCATTACCACGTCTTATTGACTTTGTCCCCAAGAATCAGTACAAAGAAAAAAGCAACGAATCTCCAACATGTGTTAGACCTACAACGGCTCCTCCTCCACCACCCGTACCTCCACCAGAAACCAGAGAAGATGGATCGGGAGCATCACTTGATACAATTGGTGAACTTGATGAAACCTGCGACGTTGTAGACAACGCGAAACAAACGAAGGATATTGAAACGAATAAAAGAAAATCACATTACGAGAGAAACTTCTGGGATGATCGAGTTAATAGAAGTGATGCCGAAGACAAAAGCCTTTCTAATGATGAAACTCCATTGCCTTCTTGTCCTCGACAACCTCCCGATGGAGTAGAGACACCTTCTGATGTTGTTGAAAAAGTCCCAGACCTCCCTACAAGATTACCTCCAACTACTTGGATAAGTACTCGCTCTAGATACTCATATGCATCTCGGACTCAAGACGCCAGAAGTGAAGCAAGTGTTAAAGATAAAATTGCGATGTTCTCTGTTGACCTCGCTGCGTCATCAGATCGATTGGATAAATGTGGTACGTTACCGGCCAGATCACATTCAACAGTAAGAAAAAATGCGCCTTCTTTTAAAAATGATTACACAGAGATATCATCCTTGGATCGACGTCTTACAAAGTCTCAAGATAATTTGGATGAGACACCTCGCTCATATAAACGGAACTTAGACAGACCTGAGGTACTTGGGGCTTTGGAAAACTCACATATCGAAAAGAAAGGTTATAGGTCAAAGCCATCAGATATGTTCGGTAAACCCTTATTCTATGGGAGTACGACTACTTTGCCTACAAATATATCACCTAATAAATCTATTTATCATGCACGTAGTATAAGTGATGATAACAGCAACAAAAACTCTGCTTCACATAAATCTAATCTCGAGTATTTGATAGAGCAAAGAAAGAAGTCAATGTCAAAGTTGCGTGGTTTAGTAATACCTGAAGTTCAAGCCCCTATTGTAGACTTGCCTGAAATTAAGGTCAAGGATCCACCACCAAAAACTTTACTTCATAATTTCTCACAGATCCAAACAACAAAAGAAAGTAAAAGCGGTTCTAAAAATCCCTTATTTGTGTCAGAAAATAAGTGGAATACCAGCTTTTTAACAAATAATATACCAAAATATTCACCAGCATTCAAACGAAAATTTCTACAAGTTTATACGCCTTCTCTATCGAAGGAAACAACTCCAGAGAGAAAATTGCCTTCAAAGGTAGACACAAATGAAAGAATTATAAACAATTATAACAATTCTTTTTCGTCTAAACCTGCTAGGATTACACAAGAATACCCGAAACCCACGAACAAGTCCACGCAATTGTTAAATAAAGAATTCTATAAAATTAACTCCAATGATAAAAATAATGATTTTAAAAATTATGAAATACGAATTTGCACTACTACAGATGTTGTGGATAGTGATAACGATTCTGCCATGAGTTCTACTCAATCAAGCTATCGTTCTTCAGCTTCTTCACCTATGCATAATATGGACCACTTAGAGTCAGATAGCTCACGATTGTCTCCCAAACTTTCACACTACAATTCGTATTCCATAGTAAAGACAGAAATCCCGAATAAGTTACTACCTATCTCTCAAAAGAACTATGAGGAATATAGCAAAAGAAAAGTTTGCCGATCTATGTCTTCGGACACAAATATTTCCCTTAGTTCTTCTGCAGGTTCTGCTGCTACATCAGGATCACAAGCAAGTTGTAGCTCTCTTGAAAGTTCTGTGGCTGACTCAGATAAGAGAAAAGTTTCAACAATATATAATGTGGATACAATAAATCGAAGAAACATTGTAGCTTCTTCAAAGTGTAGAAGCGGAAGAGACGTAACGCTTACGTCCCCAGTTATTGAAACTAAGTTCTCGCATGAGTCCTACAACAGATCTCCTAGTCCAACACACAAACCGTCTGATCGCCTTTCGACGCTCACATCAACAGTGAGCGCTGTAGCGAATAAAGGTGAAAACAGACGACGTAATAAAAAAATTGTTTCTGATTCTGATTCTGATAATGAAGAAAATATCAGACAGCGCAAACCTGACTATAAGAATACAAGATTAAAACGTAATTCTAGTACAACGAATAAAAATAATAAATATAGTGACAATGTCCAGGTAACAGAAGTAATAGAAAAGGAATCTGTTTTAAAAGAAAAAGTTTCAAAAGGAAAAGATTGTAATAACGAAATTCAGGATAATCAGAAAGATTTAGACAAAAATAAATCTGAAGTAATCAACACTACGGGTCTTACGAAAATTGAAAAGGACAAAGAAAAAAGAAATGGAATTACACCCAAAAAAGACGTTCCTGTAAAGTTGCAAAATGTTAAACCTGTATCTATCCCTGTGATAAATTCTAATAATGTGCAAGAGAAACCAGTGAAGGTGACTACACAAGTAATACGTCTTATAAAGGGAGCTGGTTCTGGGGTTGGTCTTATATTAGCCGGTGGAATTGATTGTGAGGCTAAGGATGTAACAGTACATCGTGTGCTAGAGGATAGTATAGCAGCAAAAGCTGGGATAAAGAGGGGATCAAAAATTCAAAGTATAAACGGCAATGCTATGAGCGGAATGACCCACGCTCAGTCCGTAAAGGTGTTAAAGGAACAGCGTTCAGAAGTCATCATAGAAATAGAACTTCCGGATAACAGAACTCTTAAAGACTGCGGCTCTCAACATTCGGAGTCGAAAGGGCAACAGGGAATGGACGGAACAAAATTCCGCAACAATTCAGGCCGTTCAATTGTGACTGTTATATTAGAAAAAGCCGGGGGTGGTGCTGGATTGGGTTTTGGCCTAGATGGAGGAAGAGATTCTCCTCAAGGAGACAAACCTTTGACTATAAAAAAACTGTTCGCCGGAGGTGCGGCTGCTCAAAGTGGGAAAGTTTTGGTTGGCGCAGAACTGCTCTCTGCTGGTGGTCAAGCTATGGAGGGATTTACTCGTACTCAAGCGTGGGCTGCTCTTAAAGCCTTACCAGCCGGTCAAGTGACTTTAGTGTTACGGAACCCGTAA

Protein sequence:

>DPOGS203692-PA
MRHMCTIQTSPEAVIRTAGPSLVCGMEADAAVEEFVTIVPVGENSTVVQDPEFVTVLNVSVTKGGAEVIVNRPRGARLGLGLKFEGGSAATEKVRRLLVQSCAEDSPAANACTPWGKLISGDEILAIDGTPVSELTRIECVRRLKDSDESLILLVRHFETLDKVDQQNMETKETVMTDSKTFVDISRPLTLPPPVPPRKLGKKNSFKDKTTLQTVGEQNALMNLNLSDTGNDKDSIISALVHQHSMVSTMNKLSRKSSFDSHIHRQTKESFAHLKKGSPEEVRRLVRRLSDGKTIPPEAEVYIDLLSNEWERCLVLADAESDDTGSSISTVVDRLGSMASSVENSIPSTPVMQQKSIDIQKVLNSIENIDSDILKDMTNVKYSETKVDRNDDLKRCLRNSMTNGQKVSNTNSNIVTENIKPKQCENDNNNNETSKTLAPNDTFAINDKKMKEDVVLDKPKPMPRSTKVERSASEKKRRPVPIPEVQPPNISEPLLTPTKKTCIESWLQRSEAEMQNTSTEKIEETTIPEALPRLIDFVPKNQYKEKSNESPTCVRPTTAPPPPPVPPPETREDGSGASLDTIGELDETCDVVDNAKQTKDIETNKRKSHYERNFWDDRVNRSDAEDKSLSNDETPLPSCPRQPPDGVETPSDVVEKVPDLPTRLPPTTWISTRSRYSYASRTQDARSEASVKDKIAMFSVDLAASSDRLDKCGTLPARSHSTVRKNAPSFKNDYTEISSLDRRLTKSQDNLDETPRSYKRNLDRPEVLGALENSHIEKKGYRSKPSDMFGKPLFYGSTTTLPTNISPNKSIYHARSISDDNSNKNSASHKSNLEYLIEQRKKSMSKLRGLVIPEVQAPIVDLPEIKVKDPPPKTLLHNFSQIQTTKESKSGSKNPLFVSENKWNTSFLTNNIPKYSPAFKRKFLQVYTPSLSKETTPERKLPSKVDTNERIINNYNNSFSSKPARITQEYPKPTNKSTQLLNKEFYKINSNDKNNDFKNYEIRICTTTDVVDSDNDSAMSSTQSSYRSSASSPMHNMDHLESDSSRLSPKLSHYNSYSIVKTEIPNKLLPISQKNYEEYSKRKVCRSMSSDTNISLSSSAGSAATSGSQASCSSLESSVADSDKRKVSTIYNVDTINRRNIVASSKCRSGRDVTLTSPVIETKFSHESYNRSPSPTHKPSDRLSTLTSTVSAVANKGENRRRNKKIVSDSDSDNEENIRQRKPDYKNTRLKRNSSTTNKNNKYSDNVQVTEVIEKESVLKEKVSKGKDCNNEIQDNQKDLDKNKSEVINTTGLTKIEKDKEKRNGITPKKDVPVKLQNVKPVSIPVINSNNVQEKPVKVTTQVIRLIKGAGSGVGLILAGGIDCEAKDVTVHRVLEDSIAAKAGIKRGSKIQSINGNAMSGMTHAQSVKVLKEQRSEVIIEIELPDNRTLKDCGSQHSESKGQQGMDGTKFRNNSGRSIVTVILEKAGGGAGLGFGLDGGRDSPQGDKPLTIKKLFAGGAAAQSGKVLVGAELLSAGGQAMEGFTRTQAWAALKALPAGQVTLVLRNP-