Monarch geneset OGS2.0

DPOGS215772
TranscriptDPOGS215772-TA1848 bp
ProteinDPOGS215772-PA615 aa
Genomic positionDPSCF300041 + 1729682-1737632
RNAseq coverage1885x (Rank: top 7%)
Annotation
HeliconiusHMEL0140960.084.88% 
BombyxBGIBMGA003652-TA0.088.96% 
DrosophilaLamC-PA2e-16553.78% 
EBI UniRef50UniRef50_B4JAF21e-16250.16%GH10295 n=26 Tax=Endopterygota RepID=B4JAF2_DROGR
NCBI RefSeqXP_001605883.10.061.23%PREDICTED: similar to ENSANGP00000015219 [Nasonia vitripennis]
NCBI nr blastpgi|1565447360.061.23%PREDICTED: lamin Dm0-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|1565447360.059.87%PREDICTED: lamin Dm0-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00051981.3e-168structural molecule activity
KEGG pathway 
InterPro domain[20-615] IPR0016641.3e-168Intermediate filament protein
[55-404] IPR0160441.3e-69Filament
[464-572] IPR0013227e-31Intermediate filament, C-terminal
Orthology groupMCL10482 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215772-TA
ATGTCATCAAAATCAAAAAAGACTGTAACATCATCGACCAATATAAGTGTTGTGTCTTCAGTGCAAAGCCCTCCTCAATCAAGCACTCCTGTTGGTAGCCGCCCTTCTAGTTCAGCCGGTCGCCCCAATAGCCCGTTAAGTCCAACAAGACATACTCGTCTACAAGAAAAAGATGCGTTACAAAATCTTAACGACCGCCTCGCAGCTTATATCGATAAGGTTCGCCAGCTCGAAAGTGAAAATTCAGGGTTACGCCGTGAGATACAAACAACACAGGAAGTGGTTACACGTGAGGTTTCCAACATCAAGGGAATGTATGAGCATGAATTACAAGATGCCAGAAAACTCTTAGATGATACCTCCAGAGAAAAAGCCAAATTGGAGATAGATTTGAAGAGACTGTACGAGGAAAATGATGACCTGAAAAAGCGCCTAGATAAGAAGACCAAGGATTGTCAGCAGGCTGAAAATTTGGCGCGTCACTATGAGACACGTTTCACCGAGGAGAGCAACAAATATAACACAGCACTCGCAGACAAGAAGAAGGCTCAAGATGAGGCCAGGGATCTCGCCAAGGAGTTGGAGAAATTGCGGAAGGTGTACGCTGAAGCGCGCAAGACCTTAGAAGATGAGATGCTGTGTCGCATTGATATGGAGAATACGGTGCAGAGCCTCAGAGAGGAGCTGTCCTTCAAGGAACAGGTGTTCCAGCAGGAGATGCAGGAGACGCGCACCAGGAGACAGGTGGAGATCTCCGAGATCGACGGACGCCTGGCCCAGCAGTATGAGGCGAAACTGCAGCAGAGTCTTCAGGAACTACGCGAGCAACAGGAAGCTAACATCAAGGCGAATCGTGATGAAATAGAGGCCTTATACGAGAACAAGTTGAAGAATCTACAATCGGCGGCTACTCGTAACAACACAGCTGCGACCGTGGCCGTCGACGAGCTCAGGACCATGCGGACAAGGATCGACTCGCTGAACAGTACCCTCAACGATCTGGAGAACAAGAACGCTGCTCTCAGCAACCGCTGCCGCGAACTCGAACGCCAACTAGAATCCGAGCGAGCTCGTCACGCCGAGGACCTGGCGTCCCTGGAACAAGAACTGGCGCGGCTGAGAGACGAGATGGCGTCCCAGCTCAGGGAGTACGCCACACTCATGGACATCAAGTGCTCGCTGGACCACGAGATCGCCACCTACCGCGCGCTCAAGCTCCCGCTCCTCCTGTCCATGTTGAACCTCACGTCTCAGTCCCCCGGCCGTGAGTCCCGTGCGTCCATGAGCGCCTCAGGTAGTGGTCGCGTCACACCAGGTCGGCGGGCGACTCCTCTCCGCGCCGCTCGTAAGCGCACCTTGCTAGATGAAAGTGAAGAACGCAGCCTTCAAGACTTCAGCGTCACGTCCAGCGCTAAAGGAGACCTGGAGGTCGCAGAGGCCTGCCCAGAGGGAGCGTTCGTCAAGATCCGCAATAAAGGAAAGAAGGAGCTGAGCTTGGGCGGATACCAGATCCTTCGCAAGGCTGGTGACCAGGAGACTCTGTTCAAGTTCCACCGAACAGTGAAGTTGGAGCCGGGCGCGGTTAGTACGGTGTGGTCGGCGGACGTGGGAGCCCATCACGACCCGCCCACCAGCATCGTTATGAAGGAACAGAAGTGGTTCGTGGCCGACTCGTTCGTTACATCGTTACTTAATAATGATGGAGAAGAGGTAGCGGTGTCCGAGAGACAACGACGACAGATAAGTACAAGTGCGCAGCGACACCGGGAACTAGCACACAAGTTCCCACGTCGCGAACAGCAACTGGGCGAAATTCGTGAAGGTGAAGAGAATTGTCGTATTATGTAA

Protein sequence:

>DPOGS215772-PA
MSSKSKKTVTSSTNISVVSSVQSPPQSSTPVGSRPSSSAGRPNSPLSPTRHTRLQEKDALQNLNDRLAAYIDKVRQLESENSGLRREIQTTQEVVTREVSNIKGMYEHELQDARKLLDDTSREKAKLEIDLKRLYEENDDLKKRLDKKTKDCQQAENLARHYETRFTEESNKYNTALADKKKAQDEARDLAKELEKLRKVYAEARKTLEDEMLCRIDMENTVQSLREELSFKEQVFQQEMQETRTRRQVEISEIDGRLAQQYEAKLQQSLQELREQQEANIKANRDEIEALYENKLKNLQSAATRNNTAATVAVDELRTMRTRIDSLNSTLNDLENKNAALSNRCRELERQLESERARHAEDLASLEQELARLRDEMASQLREYATLMDIKCSLDHEIATYRALKLPLLLSMLNLTSQSPGRESRASMSASGSGRVTPGRRATPLRAARKRTLLDESEERSLQDFSVTSSAKGDLEVAEACPEGAFVKIRNKGKKELSLGGYQILRKAGDQETLFKFHRTVKLEPGAVSTVWSADVGAHHDPPTSIVMKEQKWFVADSFVTSLLNNDGEEVAVSERQRRQISTSAQRHRELAHKFPRREQQLGEIREGEENCRIM-