Monarch geneset OGS2.0

DPOGS214314
TranscriptDPOGS214314-TA2016 bp
ProteinDPOGS214314-PA671 aa
Genomic positionDPSCF300020 - 871256-878654
RNAseq coverage226x (Rank: top 44%)
Annotation
HeliconiusHMEL0222708e-13351.45% 
BombyxBGIBMGA004126-TA0.063.86% 
DrosophilaCG8679-PA5e-5838.18% 
EBI UniRef50UniRef50_D6WTD76e-10337.82%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WTD7_TRICA
NCBI RefSeqXP_973306.22e-10337.30%PREDICTED: similar to CG8679 CG8679-PA [Tribolium castaneum]
NCBI nr blastpgi|2700102542e-10237.82%hypothetical protein TcasGA2_TC009633 [Tribolium castaneum]
NCBI nr blastxgi|2700102545e-10637.65%hypothetical protein TcasGA2_TC009633 [Tribolium castaneum]
Group
Gene OntologyGO:00056351.1e-10nuclear envelope
KEGG pathway 
InterPro domain[5-133] IPR0206831.2e-20Ankyrin repeat-containing domain
[414-450] IPR0110151.1e-11LEM-like domain
[416-449] IPR0038871.1e-10Lamino-associated polypeptide 2/emerin
Orthology groupMCL13211 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214314-TA
ATGGATACGACGTTTAAAGAGTTATTCAAACTTTATGATATTATACAAAATGATGATTCAAGATCAGTCGAAAGATTTTTAAATACGCAGAAAGTAAACATTAACTTAGTTTTACCCTGTAAAGGTATTGGAGTGTTACATTTAGCTGTAGGTATAGAACCTTTGGATAAATCCAGAAATTGCACAGAAATATTATTGAAGCATGGCGGCGATCCTAATCTAAGTAATGACGATGGAATAACCCCAGTCCATATAGCAGCAATATGGGGTAGAGTGGATAATTTGAAGCTGCTTGTTGGCTGCGGCGGAGATCCGTCAAGACGTGATCTGGACGGTATCTCAGCATTTGATTATGCGGTGAGAGAACAACAGTGGGATGTTTACGATTACTTACATAATGTTATTGACAAAAGTGATTCCACTGATCCCATTGAAACGGAGTGTGCGTACACACTTAACATCGAACGGGTCTTAGTTACAACAGACCACGTGGTGGCGGAATATGAACCACTAAAAGAAAACCAGACAGATATAATCAGTAGAAAATCTGATACTATCAGAGATTGGTTGGATAGGAACAGCTTGGTCATAAATAAATTATATCCCACCATACATGGGGAGACGTACCTCATAGAGGACGAATCCGAACCGTGGACTGACGACACCAATAACACTTTCATGACCTGTTGTTCAAAACGCGAAGTGGAAATAACCGGGGTGCATCCCCTGGAACTGACTTCCGACAAGAGCAAGTTCTCAAACGAGAGCAAACAGAGAGCTAGCGTGTATGAAAATTTTTCCCTACCAGGCTCACCCAGTTGCATCACGCACAGAAACTATAACAGTAAAACCAGCCTCAGGAAATTGGGTTCCAAAGAGCTGATCGCTACCAACCAACTGACATTAGACAGTCATATAACTGATGACGACTCTCAAAGTGATCTGGACAAAGCGATGATTATGGTCCAGAATCGCACCAACGAATGGTTGTGCGGTGACGATAGGAACAGGAATACTAGGCGATCATCAGCCAGTAGCGGTGTTAGCAGCACTCCATCTGGCAACATAATGCTGGCTAACGTCCACGAGGAATACAAATACGAGGACCCTGAAGAAGACGTCGTCCTCATAGAGAGGAGACTACAAGTTTCCTCGATCGTACTGCCGGTAGACGAAGGGTCAGACGCCGATCAAACGATACATTCAATAGCCTCGTCTTTGCCATTTTCCGTTTCCTACGACAACGACGCTCTAAGATCCGAACTCATACGCCTAGGAGTTACCCCGGGTCCCATACAGAATACAACAAGGACGCTGTATTTGAAAAAACTCCAAGCCTTAAAAACGAATCCTGTAATCGCTGAGCATCAAGAAACTGGTAAAACTTACAGTGTAGAACTAACTAAATCACTCCGCAATGATGATTGGTTACAAAATTTGTCTCCATACATCAATATACAGACAAAAGTGCAGGCTGATTTTGAGAATCCAAAGAAGATTTGGAGGGAAGGTAACGCGAAAACTTCATTCACCTACCTACTACTGGATCCACGAGTCACAGGAAACCTCCCGGCCAGAGCCAACAGTCAAAGCCATCAGGTCTCTTGGACTACATTCATTAACTCCATCTTCTACATCGGTAAAGGTAAACGTTCAAGACCATATGCCCATTTATACCAAGCGCTGACACCGTGGAAACACAATATAAGGAAATCAAGCAAACAGAAAGTCCAACATATACTGGACATATGGTCGGACGGTTTGGGTGTAATATGTCTGCATGTGTTCCAAAACATAATTCCAGCCGAGGCGTACACATATGAGGCTGCAATGATTGACGTCATCGGACTCAATAATCTTAAGAATGAGAAAATCGGCAACTACTATGGCGCAGCCGAGCAGTTGACTAGGAAGGAAAGACGAATGCTTGGGTTATATCTCTTATACAAAGCATTGAATATTTATATCAACGAAGGTGAAAGGCAACTCAGACCGGACAATGTTAAGGGAGACTGA

Protein sequence:

>DPOGS214314-PA
MDTTFKELFKLYDIIQNDDSRSVERFLNTQKVNINLVLPCKGIGVLHLAVGIEPLDKSRNCTEILLKHGGDPNLSNDDGITPVHIAAIWGRVDNLKLLVGCGGDPSRRDLDGISAFDYAVREQQWDVYDYLHNVIDKSDSTDPIETECAYTLNIERVLVTTDHVVAEYEPLKENQTDIISRKSDTIRDWLDRNSLVINKLYPTIHGETYLIEDESEPWTDDTNNTFMTCCSKREVEITGVHPLELTSDKSKFSNESKQRASVYENFSLPGSPSCITHRNYNSKTSLRKLGSKELIATNQLTLDSHITDDDSQSDLDKAMIMVQNRTNEWLCGDDRNRNTRRSSASSGVSSTPSGNIMLANVHEEYKYEDPEEDVVLIERRLQVSSIVLPVDEGSDADQTIHSIASSLPFSVSYDNDALRSELIRLGVTPGPIQNTTRTLYLKKLQALKTNPVIAEHQETGKTYSVELTKSLRNDDWLQNLSPYINIQTKVQADFENPKKIWREGNAKTSFTYLLLDPRVTGNLPARANSQSHQVSWTTFINSIFYIGKGKRSRPYAHLYQALTPWKHNIRKSSKQKVQHILDIWSDGLGVICLHVFQNIIPAEAYTYEAAMIDVIGLNNLKNEKIGNYYGAAEQLTRKERRMLGLYLLYKALNIYINEGERQLRPDNVKGD-