Monarch geneset OGS2.0

DPOGS205330
TranscriptDPOGS205330-TA6294 bp
ProteinDPOGS205330-PA2097 aa
Genomic positionDPSCF300322 + 471637-488021
RNAseq coverage582x (Rank: top 22%)
Annotation
HeliconiusHMEL0159990.069.24% 
BombyxBGIBMGA007388-TA0.078.12% 
DrosophilaCG17839-PB3e-17344.01% 
EBI UniRef50UniRef50_E0VKS00.051.71%Predicted protein n=6 Tax=Pancrustacea RepID=E0VKS0_PEDHC
NCBI RefSeqXP_973440.20.055.38%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892398440.055.38%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1892398440.055.90%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00055152.2e-14protein binding
KEGG pathwayrno:840156e-13 
 K12567 (TTN)maps-> Dilated cardiomyopathy
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[1873-1965] IPR0089575.4e-20Fibronectin type III domain
[294-405] IPR0137831.1e-15Immunoglobulin-like fold
[540-621] IPR0039612.2e-14Fibronectin, type III
[197-289] IPR0026023.7e-14Domain of unknown function DB
[1696-1762] IPR0130982.6e-08Immunoglobulin I-set
[1691-1769] IPR0035995.9e-08Immunoglobulin subtype
[1697-1758] IPR0035981.3e-07Immunoglobulin subtype 2
Orthology groupMCL17725 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205330-TA
ATGTTGTGGCCCCACTTTAGTTCATTGATATTCGCGATGAGACCACAGAGGGTCCAGATGACCTTTTTATCAGGACCGGGGGTCAAAATTGATACCAAAAGCGATGTCAAAGTTTTGACCAATAATCAAGATGACAAGTCAAAGGAGGCGTGGATGAGGGCGCCGGCTGGTGCTGTCACTGTGAAGAGTGGCGAGGATGCTCTGCTGACTTGCGTGGTGCTAGGAGCTAAGGAAAGACCGGTGATATGGAAACGGGCGAGGGATCATCAAGTGTTGACAGCTGGCAACCTGAGAGTGACTCGAGATGATAGGGTGCAAGTGTTGCACGATGATTCTGAAGAGCCCCTCCAAGGTCCAGGCATCACAAAGGGCGGGGACGTGTGGGCGCTCGTCATAAAAACCGTCAAAGCCTCAGATGCCGGAATGTACGTGTGTGAACTAAACACAGAACCGCCTGTCAGAAGTTTCCACAAATTGACTGTAATTTCGAGAGGGCTAACTCCACCAGAGAATCAAAATACCACTGATAGCTACTCCGCTAACGTACCTACACTATCATCAATAGCGTTCTCGCATAACTATACTGATTGTTGTTTATCTGCGAACGTGACAAAAAGCTGTTTGGGCTACTGCAGTATACAGTCTATTCTGGATGGGCCTGGACAAGATCCAGAAGCGTGTCAAACTGATTTCCCGTCAATTGTAAAATGCATGGCGGATGGCCGTAATCATGTACCATGCTGTGTCCAAGAGCATATTCCTGATATATGCCAAGACGTGTGCAGAGGAGAGTACTCTCCGGTAACAGATAACATTAAAACTCACTACTCCTGTGCTGCTTCGATGGACAAAACTTTAGCTTGCATCGTTGAGGGCATAGAACTCCTGCCAAGTCAACCCGAAGACGTAGAAGTGGAGCCTCTAAATGAAAAACAACTAAATGTCTCATGGAACCCACCAACAGAAAACACAGAATCTGTCACTGAATATATCGTAAATGTGACAACCTTGCGATCCTTTGATGCCCATTTAATAGATCCCTCCGAGAGTTCTCTAAAGAATATAAGCGACAAAACTAATACAAAAAATTTCAAAGTACCATCAAACAAAACTTTTCTAGTTATAAATGACCTACTACCCTTTACGATGTATGAGATTACAGTGACGTCATATAATATCCACGGTTCGAGTTTACCTAGCTATGGGATTAGATCGTTAACCCTCACACCCGGTAAGATGAGGAACACAGAAGTAGCGGCAGCCCCAAAACTGCCCGATGTTAAAGGTTGTTGCGTTAAAGCCGGTATCAATCATTATGGATGCGTTGACAAACTTTGTGATCCAACCAAAACTTATGAGATTGCGTTCGTCGGTTCTAACCACCAGTTGCAGGTCACAGATCTCATGATCTGCGCTCCTTGGGCCGGCATAACTTTCGGATGTCTCGCCAACGGCATGGACCATTCGCCTTGTTGTCGAGCTAGAGGACTACCAGACACTTGCCTTCCATTGTGCAGTGGAAATATCACAACGTTGGATTTCAATCACTTTAAGTGTCTTCGTTACATGACTGCGTACACAAATTGTCTTTTACAAGGCTACGGAGTACTACCAGGTCCGCCATCCCGCCTTCACCTGACTAACATAGACACAGACTACGCTGTAGTGCATTGGGCACCACCTGCAGTACTAGCCGACACCGTCCAGTATTACCACCTGCATTTCAAATCTCTACAGTCTGAAGAAGATTATAGGACATTGGATAAAGTACAGTCTCCGTACATTCTTGAAAATTTAGAATCAAACACTGATTACGAGATCTATGTAGAAGCAGTGAATGAGCACGGTGTTGGAGAATCATCGCCTCGACTTATATTCAGGACTCAGAGTCAGGTGATAGAGGAAGAGGAAGAGAACGCTCTTGCTTACAACGTGACTGCATGTTGTTCTAACGTGGGCGTCTCATCCGTGTGCATGCCGCTGTGTGCGTACGACGCACGCATGTCCGATCTGCGTTCTCTGGCGCCACTCTATATCAAAAGCGATGTTAAAGTTTTGACCAATAATCAAGATGACAAGTCAAAGGAGGCGTGGATGAGGGCGCCGGCTGGTGCTGTCACTGTGAAGAGTGGCGAGGATGCTCTGCTGACTTGCGTGGTGCTAGGAGCTAAGGAAAGACCGGTGATATGGAAACGGGCGAGGGATCATCAAGTATTGACAGCTGGCAACCTGAGAGTGACTCGAGACGATAGGGTGCAAGTGTTGCACGATGATTCTGAAGAGCCCCTCCAAGGTCCAGGCATCACAAAGGGCGGGGACGTGTGGGCGCTCGTCATAAAAACCGTCAAAGCCTCAGATGCCGGAATGTACGTGTGTGAACTAAACACAGAACCGCCTGTCAGAAGTTTCCACAAATTGACTGTAATTTCGAGAGGGCTAACTCCACCAGAGAATCAAAATACCACTGATAGCTACTCCGCTAACGTACCTACACTATCATCAATAGCGTTCTCGCATAACTATACTGATTGTTGTTTATCTGCGAACGTGACAAAAAGCTGTTTGGGCTACTGCAGTATACAGTCTATTCTGGATGGGCCTGGACAAGATCCAGAAGCGTGTCAAACTGATTTCCCGTCAATTGTAAAATGCATGGCGGATGGCCGTAATCATGTACCATGCTGTGTCCAAGAGCATATTCCTGATATATGCCAAGACGTGTGCAGAGGAGAGTACTCTCCGGTAACAGATAACATTAAAACTCACTACTCCTGTGCTGCTTCGATGGACAAAACTTTAGCTTGCATCGTTGAGGGCATAGAACTCCTGCCAAGTCAACCCGAAGACGTAGAAGTGGAGCCTCTAAATGAAAAACAACTAAATGTCTCATGGAACCCACCAACAGAAAACACAGAATCTGTCACTGAATATATCGTAAATGTGACAACCTTGCGATCCTTTGATGCCCATTTAATAGATCCCTCCGAGAGTTCTCTAAAGAATATAAGCGACAAAACTAATACAAAAAATTTCAAAGTACCATCAAACAAAACTTTTCTAGTTATAAATGACCTACTACCCTTTACGATGTATGAGATTACAGTGACGTCATATAATATCCACGGTTCGAGTTTACCTAGCTATGGGATTAGATCGTTAACCCTCACACCCGGTAAGATGAGGAACACAGAAGTAGCGGCAGCCCCAAAACTGCCCGATGTTAAAGGTTGTTGCGTTAAAGCCGGTATCAATCATTATGGATGCGTTGACAAACTTTGTGATCCAACCAAAACTTATGAGATTGCGTTCGTCGGTTCTAACCACCAGTTGCAGGTCACAGATCTCATGATCTGCGCTCCTTGGGCCGGCATAACTTTCGGATGTCTCGCCAACGGCATGGACCATTCGCCTTGTTGTCGAGCTAGAGGACTACCAGACACTTGCCTTCCATTGTGCAGTGGAAATATCACAACGTTGGATTTCAATCATTTTAAGTGTCTTCGTTACATGACTGCGTACACAAATTGTCTTTTACAAGGCTACGGAGTACTACCAGGTCCGCCATCCCGCCTTCACCTGACTAACATAGACACAGACTACGCTGTAGTGCATTGGGCACCACCTGCAGTACTAGCCGACACCGTCCAGTATTACCACCTGCATTTCAAATCTCTACAGTCTGAAGAAGATTATAGGACATTGGATAAAGTACAGTCTCCGTACATTCTTGAAAATTTAGAATCAAACACTGATTACGAGATCTATGTAGAAGCAGTGAATGAGCACGGGGTTGGAGAATCATCACCTCGACTTATATTCAGGACTCAGAGTCAGGTGATAGAGGAAGAGGAAGAGAACGCTCTTGCTTACAACGTGACTGCATGTTGTTCTAACGTGGGCGTCTCATCCGTGTGCATGCCGCTGTGTGCGTACGACGCACGCATGTCCGATCTGCGTTCTCTGGCGCCACTCTGTGATCGGGACTTCCCCAAAATTGTTCGATGTGGGGCTGGTGGTCGAAATCACGAGAAGTGCTGTGCTAGACGTGGGGTGCCGCGTGCCTGCAGCTCTGTGTGCACAGGAGCAGCACAAGTATCGTTATTTTACTCCTGTGTCGCATACATCGGAAACGTTGTGCAGTGCTTTGAGGAAGGTACGGGCACATTACCTGGTCCACCACGTGAAGTACACGCAGTGATATCAAGAGATAAACTATTTTTGGACTGGTCACCACCAGCTGATGGCGCTAACGCTACTTCATACGTCGTGCATTGGCAGAAAGTCGGGAACAACACCCAGCCTTACTATTACAACACTTTACAGTTGGACAACAAAATAAATGCAACTGAGACTATGGCACGGATGGAAAATTTGGAGGTAAACAGCTCATACCACATGTTTGTGGTGGCGGTCAACGAACATGGAACCTCTCTACCATCCAGCATGCTGCTCATCAACATAACTAATGGTGACGCTGAGAACAAAGAGGTATCCGGTATTCCATCTCCTCCTCACTCACTCTCTGTTGCGTCTCACTCAGCCACATGGCTGGCTCTGAGCTGGCAACCTCCACAATTCTCCCTACCCGATGAAAAAATATCTTATACATTGTACGTTAAAACGCCGTCAAGCCAATCCAACACGACTAAGATCACTACATCAGTGACTGGCCATACGCTGGAGAAGTTGTCTCCAAACACCCAGTATGTAGCCTGGGTAGTTGCGACTGCCGGAGCTGGTAGTTCACCGCCTTCCGAAACTTTACTAGCATGGACTGATCCAGCTTACCCCGCTTTCGTTGAGCCCCCGACAATCAACCCGGTTAACCTTGTAGTCGAAGGATCTAGCATGACTATTCTTTGTATTGCCATGGGAACCCCAACCCCCACCATATCGTTATACATTTCTGGTCACCTCGTCCGTCAAGAAGTAACCCGTCACATGGTAACAGTCGTCCACAACGTCACGAGAGATATGGAACTGATATCATGTTACGCGGATAACGGATACGGCACCCCTATGCAGGCGTCTCGTAAAATCAATATATCACACATACCTACCATCCAAGCGTCAGGTATTACCATGGCTGCGGCGGGTGACTCTGTTATACTGGAATGTCGTGTGGAAGCGCTCCCCAAACCCACAATCGCTTTCTGGAGGGACCCTAATGGAAGAACACCAGAAAAAACAAAATACACGATGCGATTAATAATAAAGAAGATAACGGAAACTGATGAAGGAGATTACTTTTGTCACGCGGAAAATGCATTTGGCAAAACATTACGCCCTGTGTCGGTACGACTGCGAGCAACCGGCCCACATCACAATGTCACTGAATGCTGCACACAGATGAACGTGTCGTCGTCATGTATCGACGCGTGCAGCTTTCACTTGGACATGGACAACATCATGGACCGTCCTGAGTGCATGAACGATTTTGATAAACTCATGAAATGTGCTGCTGATGGATCAGATCACCGTAGTTGTTGTGCTTCCTGGGGCGTTCCTCGGAACTGCCTGGAACTTTGTCGCGGCGGTTCCGTTTCCAAGACTTGCGCTCTCCAACATGCAAGACGAGCATTGGCCTGCTTTAGAGACTCTGGTGCTAAACTACCAGGTCCACCAAGAAACTTAAAAGCACATGCTGCCCCTACACCAAATACTGTGTTGCTAAGCTGGGAGCCGCCCCTAAAGAATCCGCAAACCGTGTATCTTTATCGTGTTTTCTGGCGAGCGTATGGTGCCAAAGTCCCTGAGAAGTTGGACACAAGTGAAACCAGCGTGGTACTGACCGGTCTACAGGATGATGTGCGTTACGAATGCGTGGTGAAAGCTGCCAATGATGCGGGTACATCATCTCTCAGCCAGTCAATCATGTTTACGACTGCTGGTCAGGAGACGGGGGCATCAGCTTCTAAAGTGTCGACTTCTGGAACAGCCTCCGCAGTGGGAGTAGCGGTGGCTTGTATTCTAGTAGTCGCCATATTATTAGCTGCTGGTCTTTATTACAGACATAGAAAGAATCTTAGACTAAAGGCCCAAGGAGGAGTTGCCTTCGAAAACCCAAGCTACCTCAGAGAACCAAACCCTGATAGCGTTGTCAATGGTAACGTACCTAACGGGATTACCAACGACAATATGAACGTCATTGGCAATGGTATATCTACGAGCACAGCATGGAGACAAGAGACCCAGACGCCTGGCAGCACCATCCCACAGCGGGAGGTGGATCCTTCCTTATATGAAGAACTAAAACTCGGCCACGACGGGGCGGGCTTCAAGAGACTGAAGTAG

Protein sequence:

>DPOGS205330-PA
MLWPHFSSLIFAMRPQRVQMTFLSGPGVKIDTKSDVKVLTNNQDDKSKEAWMRAPAGAVTVKSGEDALLTCVVLGAKERPVIWKRARDHQVLTAGNLRVTRDDRVQVLHDDSEEPLQGPGITKGGDVWALVIKTVKASDAGMYVCELNTEPPVRSFHKLTVISRGLTPPENQNTTDSYSANVPTLSSIAFSHNYTDCCLSANVTKSCLGYCSIQSILDGPGQDPEACQTDFPSIVKCMADGRNHVPCCVQEHIPDICQDVCRGEYSPVTDNIKTHYSCAASMDKTLACIVEGIELLPSQPEDVEVEPLNEKQLNVSWNPPTENTESVTEYIVNVTTLRSFDAHLIDPSESSLKNISDKTNTKNFKVPSNKTFLVINDLLPFTMYEITVTSYNIHGSSLPSYGIRSLTLTPGKMRNTEVAAAPKLPDVKGCCVKAGINHYGCVDKLCDPTKTYEIAFVGSNHQLQVTDLMICAPWAGITFGCLANGMDHSPCCRARGLPDTCLPLCSGNITTLDFNHFKCLRYMTAYTNCLLQGYGVLPGPPSRLHLTNIDTDYAVVHWAPPAVLADTVQYYHLHFKSLQSEEDYRTLDKVQSPYILENLESNTDYEIYVEAVNEHGVGESSPRLIFRTQSQVIEEEEENALAYNVTACCSNVGVSSVCMPLCAYDARMSDLRSLAPLYIKSDVKVLTNNQDDKSKEAWMRAPAGAVTVKSGEDALLTCVVLGAKERPVIWKRARDHQVLTAGNLRVTRDDRVQVLHDDSEEPLQGPGITKGGDVWALVIKTVKASDAGMYVCELNTEPPVRSFHKLTVISRGLTPPENQNTTDSYSANVPTLSSIAFSHNYTDCCLSANVTKSCLGYCSIQSILDGPGQDPEACQTDFPSIVKCMADGRNHVPCCVQEHIPDICQDVCRGEYSPVTDNIKTHYSCAASMDKTLACIVEGIELLPSQPEDVEVEPLNEKQLNVSWNPPTENTESVTEYIVNVTTLRSFDAHLIDPSESSLKNISDKTNTKNFKVPSNKTFLVINDLLPFTMYEITVTSYNIHGSSLPSYGIRSLTLTPGKMRNTEVAAAPKLPDVKGCCVKAGINHYGCVDKLCDPTKTYEIAFVGSNHQLQVTDLMICAPWAGITFGCLANGMDHSPCCRARGLPDTCLPLCSGNITTLDFNHFKCLRYMTAYTNCLLQGYGVLPGPPSRLHLTNIDTDYAVVHWAPPAVLADTVQYYHLHFKSLQSEEDYRTLDKVQSPYILENLESNTDYEIYVEAVNEHGVGESSPRLIFRTQSQVIEEEEENALAYNVTACCSNVGVSSVCMPLCAYDARMSDLRSLAPLCDRDFPKIVRCGAGGRNHEKCCARRGVPRACSSVCTGAAQVSLFYSCVAYIGNVVQCFEEGTGTLPGPPREVHAVISRDKLFLDWSPPADGANATSYVVHWQKVGNNTQPYYYNTLQLDNKINATETMARMENLEVNSSYHMFVVAVNEHGTSLPSSMLLINITNGDAENKEVSGIPSPPHSLSVASHSATWLALSWQPPQFSLPDEKISYTLYVKTPSSQSNTTKITTSVTGHTLEKLSPNTQYVAWVVATAGAGSSPPSETLLAWTDPAYPAFVEPPTINPVNLVVEGSSMTILCIAMGTPTPTISLYISGHLVRQEVTRHMVTVVHNVTRDMELISCYADNGYGTPMQASRKINISHIPTIQASGITMAAAGDSVILECRVEALPKPTIAFWRDPNGRTPEKTKYTMRLIIKKITETDEGDYFCHAENAFGKTLRPVSVRLRATGPHHNVTECCTQMNVSSSCIDACSFHLDMDNIMDRPECMNDFDKLMKCAADGSDHRSCCASWGVPRNCLELCRGGSVSKTCALQHARRALACFRDSGAKLPGPPRNLKAHAAPTPNTVLLSWEPPLKNPQTVYLYRVFWRAYGAKVPEKLDTSETSVVLTGLQDDVRYECVVKAANDAGTSSLSQSIMFTTAGQETGASASKVSTSGTASAVGVAVACILVVAILLAAGLYYRHRKNLRLKAQGGVAFENPSYLREPNPDSVVNGNVPNGITNDNMNVIGNGISTSTAWRQETQTPGSTIPQREVDPSLYEELKLGHDGAGFKRLK-