Monarch geneset OGS2.0

DPOGS207502
TranscriptDPOGS207502-TA5772 bp
ProteinDPOGS207502-PA1923 aa
Genomic positionDPSCF300177 - 450544-481478
RNAseq coverage23x (Rank: top 78%)
Annotation
HeliconiusHMEL0094330.089.48% 
BombyxBGIBMGA001924-TA0.089.67% 
DrosophilaCG42330-PD0.052.66% 
EBI UniRef50UniRef50_Q0E8G90.052.66%CG42330, isoform D n=34 Tax=Endopterygota RepID=Q0E8G9_DROME
NCBI RefSeqXP_972891.20.057.22%PREDICTED: similar to AGAP007092-PA [Tribolium castaneum]
NCBI nr blastpgi|1892388650.057.22%PREDICTED: similar to AGAP007092-PA [Tribolium castaneum]
NCBI nr blastxgi|1892388650.057.48%PREDICTED: similar to AGAP007092-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055159.2e-16protein binding
KEGG pathway 
InterPro domain[886-1004] IPR0089576.1e-30Fibronectin type III domain
[900-996] IPR0137834.4e-26Immunoglobulin-like fold
[904-988] IPR0039619.2e-16Fibronectin, type III
[422-508] IPR0130983.2e-14Immunoglobulin I-set
[426-498] IPR0035985.1e-14Immunoglobulin subtype 2
[197-284] IPR0035997.8e-10Immunoglobulin subtype
Orthology groupMCL10022 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207502-TA
ATGTTCTCAACTAACTTTGTGTTCATGACTCCGACACAGGACTATCAGTTGGACTGGCTGTTCTCAGACAACAGCCCAGTGACGTCACTAGCTGGTCTGAGAAGAGTGCTGGCCAACGGATCCCTGGAACTACCACCGTTCCCAGCTGATAGATACAACAGGGACGTCCACGCTGCCACTTACCGATGTCGACTAAGACTGAGCAGTGGTTACGTACTGTCCAAGAATATACATGTACACGCTGTATTAAACATACCATGGGAGGTGCGAGTGCCAGATGAGTACGTGATGGCAGGAAACTCGGCCATATTGAGATGTACAGTGACTCCGCGCTGTGCCGACAGAATCGAATATACAGACTGGCTCACAGACGACGATACCAGAATCCTAGATTACTTAGGTTCGAAATACTCTCGTTTGGACGACGGTTCGTTGTACATAAGTGAAGTGCGTCGTGGCGAACGGTACTCCGCCTTCCGCTGCAGAGTACGCGACCGACTCACGGGTTCCGTATATTCTAGTCAATATTTCGGTCATATTATAGTGACGGAACCAAAGGGGGGCGTTCGACCGAGAATTTCGGTGGAGCCCCGCTTACGAAGAGTGCCCGTGGGTCAAGATATACGGTTACCTTGCGCTGCTCACGCCTGGCCTGTGCCCAGCTATAGGTGGTTTCGTGAGCACCACGAGGAACTGACTCCCATAGAGCAGACTTTACTATGGCCGCGGGTGACAGTGATGAAGGGTCTGCTGATTATCCAGAGAGTTCAACGTGAAGATAGTGGCAGGTTCATATGCTGGCTGAACAACACCGTTGGTGTTGAATCCGCTCATTTCACACTCATTGTTACAGAACCAATTTCCGTCCGAATAAAACCAGAATTATTAAAAACGAAACGAAACGTGGATGTTAATTTCGATTGCAAGGTCGGTGGTCATCCAATTGAAATAGTTTATTGGACCCATGATGGAAAAGTTGTTAAGAATTCAGAAAGGGTTAGGGTTTCAGAAGATGGACTGAGATTACATATCAGAAACACTCAGCAGAATGACCAGGGGATGTATCAGTGTTTTGCTAGTAATACCAGGGATCAAGCATACGGAATCACGGAATACGTCATCGATGCAGCTGACAGCCGTTCACGATCTGTCGGTAATCGCTCCCGTCGGGCTCCTTACGCACGCAGCACGCATACTCACGACACAGATTGGAACTTCACTGGAACTAGAGAATCGAAGCCAGAGTTGGTATACTGGTTCTCAGAGCAAACCTTGCAGCCTGGACCAAGCGTGTCTCTTAAATGTGTAGCCATGGGACATCCACCTCCACAATTCACATGGTTACTAGATGGGTTTCCTATACCTACTAATTCAAGATTCGTCGTTGGCCAGCACGTGACACTTCAAGACGACGTTGTCAGCCATCTTAACATAAGTCGAGTAACAGAACAAGATGGCGGCGAATACGCATGTGTTGCCAGTAATTCAGCAGGAAAAGCGATCCATGCGGCAAGGGTGAACGTGTACGGGTTGCCATTTATTCGGGAGATGCCCAAAGTTACGGCTGTTGCAGGCAGTGATTTGAATATTAAATGTCCTGTCGCTGGATATCCTATTGAATCAATTACATGGGAAAGAGAGGGGCAGACATTACCTTTAAATCGCCGCCAAAAAGTGTTTCCGAACGGTACGTTAATAGTGGAACAAACACAACGAGGAGAAGACGCCGGCACTTACACCTGCCAAGCAACAAATAGACAACGTCATGTTGCACGAAGAGATGTCGAAGTACAGATATTAGTCCAACCAAAGATATTGCCCATTCAGCCACTGACGAATTTATTAAGGGAGGGCATGAGAGCGGCTATTTCCTGTCAGATATTGGAGGGCGATTTGCCGATTGCATTCCGATGGGAGAAAAACGATCAGCCTGTGACGTCATCGCCATACGCTCCAAGCGGTATAATAACGAGGCGAATGGATGAGTACAGTGCGTCGCTCGTGATAGAACACATAACTTCGCTACATAGTGGAAATTACACTTGCATCGCATCAAACGTGGCGGGTTCTGAGCGGTTTACCGTACCGCTTACAGTTAATGTACCACCAAGATGGCGGCTAGAGCCAAGCGATGTGGCTGTGGCTGCGGGTCAGGACGTCAATCTTCAGTGTCAAGCTGATGGCTATCCAAAACCCAATATAACATGGAAGAAAGCTGTTGGTAACACTCCTGGTGAATACAAGGACTTCATGTTTGAGGGAACTTCACGGGTTCTTGAAAACGGCTCCCTGATATTTGAACGAGTTGCGAAAGATAACGAGGGTCACTATTTGTGTGAAGCAAGAAATGATATAGGAGCCGGGCTGAGCAAACTTATCTTCTTGAAAGTTAACGCCCCAGCCAGATTCCCAATGAAGAGCAAAACTGTACAAGTAACAGCAGGTGAAGCAGCTCATCTTCAGTGCGCGGCGTCAGGAGATGCTCCACTTGAAGTCAGTTGGAGAAGTCCTCATCATCACACCATCGCACATCATCTTGACCAAAGATATACAATACGCGAGCAAGTCCTGGACGATGGTCTAGTTTCGGAGTTGAGTATTTTACAGACCTACAGACAAGACACAGGTGCACTGACTTGTCGAGCATCCAATGCTTATGGACAAGACGAAATGCTTATACATCTTGTTGTCCAAGAGGTTCCTGAGATGCCAAAAAACATAAGAGTTATTGACCAGCAGTCACGATCAATACAAATATCTTGGACTCAGCCATACGCTGGGAACAGCCCCATTATAAACTACATTGTACAATACAAGGAAGCTCCTGAACCATGGCCAACGACTCCTCAAAAGGTTATTGTCCCTGGCAGCGTGACGTCATCCAGCGTCCAGAACTTACAACCAGCGACGTCTTACCACTTGCGGATCATTGCTGAAAACCGCCTGGGCCAAAGCGAGCCCAGTCAACTGGTACAGGTCACTACTACTGAAGAAGTACCTAGCGGCTCACCCATAGACGTGCGAGTCGAAGCAAAAAGTTCAACTGAATTAACAGTGAGTTGGGATCCGCCCCAGCGAGATCTTTGGAACGGAAATATTCTAGGCTATTACGTGGGATTTCAAGAATTAAACAGCAACAGTACGGTACTAAGTGCGAGCGGTCCTGGCGGTGCTTCATACACCGTGCGTACGGTTGAAGGAACGGGCTCTGCACGTGCAAGAACCACGCTCGCGGGTCTGCAGAAACATGCTGCATATGCTGTAGTCGTACAAGCTTACAACAGCAGGGGTGCTGGACCAGCTAGCCCACCTACTACTGCTACCACCTTGGAAGATGTGCCTAGTCTTCCACCTGGAACACTTAATTGTAGCGCGCTGTCATCTCAATCGGTTCGTGTCACATGGGAGCCACCACCTATGCGAGGTAGAAACGGAGTTCTGCAAGGATATAGAGTAACGTATGCACCTGTTACTGATTGGTATGGAAGTGAAGAAGCTGTGACAAAACAAATATCAGGGTTGCACACAACTTTATCAGGATTACGTCGTTATACCAATTATTCGGTGACCGTATGTGCTTTTACTGCTGCGGGTGACGGCGTTAGGGCTGCTCCCGTTTATTGCCATACTGAGGAAGATGTTCCATCTGCGCCCGCGGATATAAAAGCTGTTGTATCATCGCGTAATAAGATCTTGGTGTCGTGGCTACCCCCGACTTCACCTAACGGGGTTTTAGTCGGTTACACCCTTTATATGAGCGTCATAGAAGATGGGAGGGAGGAGGGAACTCACAAGCGTATGTTGTCCCCACGTACATTATCACATGAGACGTCTCGATCGCCTCCACGGGCCATTCATGAGTTCTGGGTGTCCGCCTCCACCAGACTCGGAGAAGGTGATTCTACGAGGGTTGTCAAAGTACTGCCGTCAGACACCGTTCCAGCAAGAATTACTTCATTTAGTCGAAGTATCGTCACACCCTGGAAGGAAAATATATCTTTGGCTTGCAACAAAGTTGGTGTTCCAATACCATCGACAACATGGAGAATGAATGGTGCAATTCTAGAATCGACCTTGAGAAAAAATGTTACCAGTGATGGCACATTGATAATTAGCATGACACAATACGCTGACAGCGGAAATTACACGTGCCTAGTTGAAAATACTCATGGTCGTGACGAGGTGACTTACGGTGTCGAAGTGAAGGTCCCTCCTCAACCGCCCGTGTTAGCTGTGGTCGACTCGTACGCTGATTCACTACACTTACAATGGAGTGATCAGGGGGACGGTGGCAGCCCTATTTTAGGATACGTTATAAACTACAAACGGGAGCACGGGGACTGGGAGGAGCTTCAAGTGGAGGCGGGTACATCTGAGCACGTGCTGCCAAACCTTTGGTGCGGAACCAGATACCAGCTATATATCACTGCCTTCAACCGCATCGGTACTGGCCTACCATGCGATATCGTGCATGCATATACCAAGGGCACCGTGCCTGTGAAACCGAAGCACTCCCAGATGATAACTCTCAACACGACAACTGTGACGGTGTGGCTCGACTCGTGGGGCGACGGAGGCTGCGGGATATTGTACTTCGTTATAGAGTACCGGGAGATAAGTCAGTCGCAGTGGAGATTAGTCTCGAACAGTGTGCAAGCAACAGAAAGAGTATTCAGTGTAACAGGACTGTCACCCGCCACGCACTACCAGCTACGGATCACAGCCCATAACAACGCGGGTCACGCTTTAGCACTTTATAATTTTACTACACATTCGTTGAGTGGAATGTTGGACGGTGAGGTGTCCCCGGCAGTGCCGGCGGCTGGTCCACGTTCTCTGGGTGGTGTACGTGTTTTGCTGCCGGCGGCGCTATCCCTGCTGGTGCTGGCCGCCCTTGTAGCCATAGTCTTATTAATTAGAAGGAACAAAGCAGGCCCAATAGAGACAACTGCACCTGAGACAGGAGTGGGCGAGTCACCGTCCGTGGCCCAGCTGCAGAACAAAGTGAATCGCGACCAGCAATACCTCGCCACCCGCGCCCAGCACCCACCACCACAGCACCATTACAAACACGCTTCAAATGAATACATCGAAGACATCTGCCCCTACGCTACTTTCCAACTGACCAAACCGAGCGCTTACAGCGAGAGCAGCTATAGCGGCAACGTATACAGCGGCCCCTACCACTCCGTGAGAGGATCCTTCGTATATCACGATCTCAAGCAAAATGATAAATACAAAGGAAAAGAACCAGAGTATACGAAAGTACGACGAAAAGGAAACAGACTGCGAGATCCTCACTCTGAAAGCCAAGAGTCGGACAACTTGGGATCAACGGACTCAGAGGTGAAGAAGATACTAACGCTGCACCTGCCAATTACAGAGTACGAGGACGACGCCAGCGACGACGCCAGCGCGCTGCATTCTGACAGCGAGCCCGGGGCGAGGCTACGGCCTACGCACCCCACCGCTTACCCCAGCGTCGAGCGAGAGGAGTCGTCATCGTCCTCGGAGAATTCGATGGGCGGAGTGGTGCGAAAGGCATTTCCCTCGCGCAAGGGCAAGGCTGGCGTCGGCAAGAGACACGTGCGCTCTTCCAGCGGATACAGCAGCCACAACGATGAGACAACCTTCAGTATATCGAACTACCCGTCGTTCAACGAGCACATCCACCCACCGTCGCGGTTTTCAGACGACACCGACTGCGGCCGCAAACGCGTCCACGCACACGCAGACAAGATGACACGAGAAGCCTTCCAGATTAACGTCTGA

Protein sequence:

>DPOGS207502-PA
MFSTNFVFMTPTQDYQLDWLFSDNSPVTSLAGLRRVLANGSLELPPFPADRYNRDVHAATYRCRLRLSSGYVLSKNIHVHAVLNIPWEVRVPDEYVMAGNSAILRCTVTPRCADRIEYTDWLTDDDTRILDYLGSKYSRLDDGSLYISEVRRGERYSAFRCRVRDRLTGSVYSSQYFGHIIVTEPKGGVRPRISVEPRLRRVPVGQDIRLPCAAHAWPVPSYRWFREHHEELTPIEQTLLWPRVTVMKGLLIIQRVQREDSGRFICWLNNTVGVESAHFTLIVTEPISVRIKPELLKTKRNVDVNFDCKVGGHPIEIVYWTHDGKVVKNSERVRVSEDGLRLHIRNTQQNDQGMYQCFASNTRDQAYGITEYVIDAADSRSRSVGNRSRRAPYARSTHTHDTDWNFTGTRESKPELVYWFSEQTLQPGPSVSLKCVAMGHPPPQFTWLLDGFPIPTNSRFVVGQHVTLQDDVVSHLNISRVTEQDGGEYACVASNSAGKAIHAARVNVYGLPFIREMPKVTAVAGSDLNIKCPVAGYPIESITWEREGQTLPLNRRQKVFPNGTLIVEQTQRGEDAGTYTCQATNRQRHVARRDVEVQILVQPKILPIQPLTNLLREGMRAAISCQILEGDLPIAFRWEKNDQPVTSSPYAPSGIITRRMDEYSASLVIEHITSLHSGNYTCIASNVAGSERFTVPLTVNVPPRWRLEPSDVAVAAGQDVNLQCQADGYPKPNITWKKAVGNTPGEYKDFMFEGTSRVLENGSLIFERVAKDNEGHYLCEARNDIGAGLSKLIFLKVNAPARFPMKSKTVQVTAGEAAHLQCAASGDAPLEVSWRSPHHHTIAHHLDQRYTIREQVLDDGLVSELSILQTYRQDTGALTCRASNAYGQDEMLIHLVVQEVPEMPKNIRVIDQQSRSIQISWTQPYAGNSPIINYIVQYKEAPEPWPTTPQKVIVPGSVTSSSVQNLQPATSYHLRIIAENRLGQSEPSQLVQVTTTEEVPSGSPIDVRVEAKSSTELTVSWDPPQRDLWNGNILGYYVGFQELNSNSTVLSASGPGGASYTVRTVEGTGSARARTTLAGLQKHAAYAVVVQAYNSRGAGPASPPTTATTLEDVPSLPPGTLNCSALSSQSVRVTWEPPPMRGRNGVLQGYRVTYAPVTDWYGSEEAVTKQISGLHTTLSGLRRYTNYSVTVCAFTAAGDGVRAAPVYCHTEEDVPSAPADIKAVVSSRNKILVSWLPPTSPNGVLVGYTLYMSVIEDGREEGTHKRMLSPRTLSHETSRSPPRAIHEFWVSASTRLGEGDSTRVVKVLPSDTVPARITSFSRSIVTPWKENISLACNKVGVPIPSTTWRMNGAILESTLRKNVTSDGTLIISMTQYADSGNYTCLVENTHGRDEVTYGVEVKVPPQPPVLAVVDSYADSLHLQWSDQGDGGSPILGYVINYKREHGDWEELQVEAGTSEHVLPNLWCGTRYQLYITAFNRIGTGLPCDIVHAYTKGTVPVKPKHSQMITLNTTTVTVWLDSWGDGGCGILYFVIEYREISQSQWRLVSNSVQATERVFSVTGLSPATHYQLRITAHNNAGHALALYNFTTHSLSGMLDGEVSPAVPAAGPRSLGGVRVLLPAALSLLVLAALVAIVLLIRRNKAGPIETTAPETGVGESPSVAQLQNKVNRDQQYLATRAQHPPPQHHYKHASNEYIEDICPYATFQLTKPSAYSESSYSGNVYSGPYHSVRGSFVYHDLKQNDKYKGKEPEYTKVRRKGNRLRDPHSESQESDNLGSTDSEVKKILTLHLPITEYEDDASDDASALHSDSEPGARLRPTHPTAYPSVEREESSSSSENSMGGVVRKAFPSRKGKAGVGKRHVRSSSGYSSHNDETTFSISNYPSFNEHIHPPSRFSDDTDCGRKRVHAHADKMTREAFQINV-