Monarch geneset OGS2.0

DPOGS203980
TranscriptDPOGS203980-TA1890 bp
ProteinDPOGS203980-PA629 aa
Genomic positionDPSCF300005 + 1070628-1075696
RNAseq coverage8x (Rank: top 85%)
Annotation
HeliconiusHMEL0135260.067.36% 
BombyxBGIBMGA002119-TA0.068.45% 
Drosophilaside-PA8e-3632.94% 
EBI UniRef50UniRef50_B3LXK46e-3432.36%HDAC4 n=3 Tax=Diptera RepID=B3LXK4_DROAN
NCBI RefSeqXP_394052.36e-3729.81%PREDICTED: similar to CG12950-PA [Apis mellifera]
NCBI nr blastpgi|3800163821e-3629.86%PREDICTED: hemicentin-1-like [Apis florea]
NCBI nr blastxgi|3800163827e-3629.86%PREDICTED: hemicentin-1-like [Apis florea]
Group
KEGG pathway 
InterPro domain[219-320] IPR0137831.2e-13Immunoglobulin-like fold
[237-305] IPR0131621.8e-08CD80-like, immunoglobulin C2-set
Orthology groupMCL26562 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203980-TA
ATGTTAAAAGAGGTTGTTGTCCTTCGAGCCGTTCGTGGTAGAGATGCGCGGCTTCCTTGTGGCCAAGGAAAGGTGTTTTTAGATGGACCAGACTCATATGTGCTGTGGCTTAAAAATGACAGAGATTTCCTATACAGATTTCCCAACGAAGAAACTGAACGGAGAGTCGTCAACAGGGATAGGATATTTGGAACTTCATGTGAAGAAGGCACATGTTATGATGATACCTCTCTTATACTAAAGCGAGTAAGTGAACAAGACGGTGGCATTTATCGATGTAGAGTACATTACCAAGCATCTCCATCTCTGGACTATGTAATCGACGTTAGGTTAGTCGATTCTCCAGGCATCCCTAGAATTTTTAATGAAGGAGGTGAAGAAATAAAAAGAGGCTACGTAGGTCCACTCAGTTTGGGTTCGAATATGTCTTTGATTTGCGAAGTTGATGATGATCAACCCGATACACTTGTGTACTGGCGTCGTAACGGTATACTGATCGAGCGGGCGCACACAGTTCGAACAGGCGTGTTGCGTGCTGCCGTTCGTGTGCGCAACGCAACGCGAGACGAACTCGATGCACACTATGAATGTTTGGCACAAAATGCCGATGTCACCGAACCATTGAGTGCCAGCGTCGTGGTTAAGATGTACCTTCCGCCACTTAGCGTTGAAATACGTCTCAACAACAATTTTGACTTCGAAGCCGGTCATCCAAGAGTCGTAGACTGCGTGATTGTTGGTTGCGTACCACCACCCGCAATTACGTGGCACCTCGGGGAACAATTACTTCGGCCTAATGTGCATAAAGAACTTCATGATGGCAACTATACGGTGTCTTCTCTGACGTTGTCGCCTTCTTTGCGCGACTCCAAGCATGATTTAGTTTGTCGAGCTCACAATTCACATTTACCAACCTCGGTGTTCGAAGATAAAGTTACTTTAAACGTTGGATATCGTCCTTTGTGTCTAAATAATCGAGAAGAGACCGTTGGAGCGCTGGTTCAGGACGCAGAGACAGTTAGCTGTGTAGTGGAAGCGTCGCCTGAGCCGCTACAGTTTAGCTGGACCTTCGCTGACTCCCGAGTATTATACACCAGTGTGAAGAAGGTATCAGGGCATCACAATAGGTATTCATCAACACTAACATGGCTTCCAAGAGAAACAGATATCGGCCTTCTTCTATGTCGAGCAACAAATTCATTTGGAGAACAAAAACGACCATGCTCGTACAGCATAGTACCTGGTGGTCCACCGCACGCCCCTGAGTGTGTCGTATTGAGATCTTACCCCCATTCCGTACGAGTACAATGTCAGAAAGGTTGGGATGGCGGCAGACAACAAACCATCCATTTGGAACTCCTAGGCATGGACGGTACGGTCTTCCACAATATATCTAATGCAGCTGGGCATTTCCTTTTACCTGATATTGAAGACGACAGAAACTTGACCGCACTTGTGTATGCAAGCAGTCCAAGAGGAAGAAGCAAAACAAGAACTGTTCATTTACAAACAATAACTCCTCTTAGTCCTTCAGCTGTGGCAGAAACAGTTCCTCTTTATTCCTGGACGCGGTGGATGGAAGTGACCGCAGGGGCGGTAGCCGTCACCGCAGCCATTGGAGCAGCTATTATTTGTATCAAACTTATGAAGACCAAACGAGAGAACATGGAAATGAATCCGGATTTAGTTTCTAGAACTGATGGTTCATTCCATCGAGAGCCAGACTTTGCACGTGAAGAGCGGGTACTGGGGACAACTAATATAACCGTTAACCAGTTAGCTTCTTGTCAGAACCAATATTCCGCTTCACCAGTGCCAGCTTGCCGCGTGCTGGTTGGATGCGCGGCTCAAGACTCTTGTCCAGCATCACAGCATTCTTACTATGTCTAA

Protein sequence:

>DPOGS203980-PA
MLKEVVVLRAVRGRDARLPCGQGKVFLDGPDSYVLWLKNDRDFLYRFPNEETERRVVNRDRIFGTSCEEGTCYDDTSLILKRVSEQDGGIYRCRVHYQASPSLDYVIDVRLVDSPGIPRIFNEGGEEIKRGYVGPLSLGSNMSLICEVDDDQPDTLVYWRRNGILIERAHTVRTGVLRAAVRVRNATRDELDAHYECLAQNADVTEPLSASVVVKMYLPPLSVEIRLNNNFDFEAGHPRVVDCVIVGCVPPPAITWHLGEQLLRPNVHKELHDGNYTVSSLTLSPSLRDSKHDLVCRAHNSHLPTSVFEDKVTLNVGYRPLCLNNREETVGALVQDAETVSCVVEASPEPLQFSWTFADSRVLYTSVKKVSGHHNRYSSTLTWLPRETDIGLLLCRATNSFGEQKRPCSYSIVPGGPPHAPECVVLRSYPHSVRVQCQKGWDGGRQQTIHLELLGMDGTVFHNISNAAGHFLLPDIEDDRNLTALVYASSPRGRSKTRTVHLQTITPLSPSAVAETVPLYSWTRWMEVTAGAVAVTAAIGAAIICIKLMKTKRENMEMNPDLVSRTDGSFHREPDFAREERVLGTTNITVNQLASCQNQYSASPVPACRVLVGCAAQDSCPASQHSYYV-