Monarch geneset OGS2.0

DPOGS210680
TranscriptDPOGS210680-TA1389 bp
ProteinDPOGS210680-PA462 aa
Genomic positionDPSCF300013 - 1054637-1061130
RNAseq coverage80x (Rank: top 64%)
Annotation
HeliconiusHMEL0174745e-9673.02% 
BombyxBGIBMGA006297-TA2e-10376.25% 
Drosophilakal-1-PA2e-4028.30% 
EBI UniRef50UniRef50_Q8WS933e-17263.54%KAL-1 n=2 Tax=Obtectomera RepID=Q8WS93_BOMMO
NCBI RefSeqNP_001037043.16e-17363.54%Kal-1 protein [Bombyx mori]
NCBI nr blastpgi|1129831201e-17163.54%Kal-1 protein precursor [Bombyx mori]
NCBI nr blastxgi|1129831200.064.35%Kal-1 protein precursor [Bombyx mori]
Group
Gene OntologyGO:00055153.8e-10protein binding
GO:00055761.8e-09extracellular region
GO:00304141.8e-09peptidase inhibitor activity
KEGG pathwaycfa:4788192e-08 
 K12567 (TTN)maps-> Dilated cardiomyopathy
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[224-332] IPR0089577.8e-12Fibronectin type III domain
[220-336] IPR0137831.2e-10Immunoglobulin-like fold
[228-319] IPR0039613.8e-10Fibronectin, type III
[61-107] IPR0081971.8e-09Whey acidic protein, 4-disulphide core
Orthology groupMCL16388 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210680-TA
ATGTGGATGATAAAAACTGGTGTGATAATACTGGCTGTTCTGATATCAGCATCGGCTAAATCAAAAAGGTACACTAGACTGCAGAGCGATCCCTTGACAACAACGAGATGTGACCTTATATGTTTTGATGCGAGCAAAGAAAATAAATCTCAGTGTCGATCAGCTTGTCGGTCAGAGACGCAAAAGCCAGGAACCTGTCCTGATGGAGACGATCCTCGCTGGATGGCCGCGTGCCTCGAAGCCTGTAATCATGACTCTCAATGCGACGGCACTCAAAGATGTTGCCAGCATGGATGCAGTTCCACGTGCAGTGAGCCCACCGATTTGTTGACTATACCGGGCCTTCCAGCGATGCCGTCAATAGAGGAACCAAAAGAAAGACGGCGCGCAGTTCAGATTAAATGGTCAGATGGTGTAGGTGATGAAGCAAGATCTGTTCCAGGTAGAGTTCTTTACCTATTGGAAGAACAACATTATCTTTGCCCCAACTACGATGAATCACGACTTGGAGAGTGGAATCTCCTGATGAGAACCAATAAAACCAAAGTGTCTCTACGTAACCAGTTGAAACCAGGTCGTTGGTATCGTTTCCGAGTGGCTGCTGTAAGTGCGTCTGGTACAAGAGGGTTCTCCGAACCTAGCGCTCCCTTCACTCCTCGTAAAGGACCACGCCCTCCACCCCCGCCAAAGAAGCTAAAGGTGGAACATGTGAGATCAGACAATGATAGTGTTACAATACGACTGGAATGGAAAGAGCCAAAATCTGATTTACCAGTGATGAGATATAAAGTTTTTTGGAGCAGACGACTTCGAGGTCTCTCAGGGGAGTTGGATTCTGTTGTCGTTAATCATCAAACTGTGCCAAAGGATCAGACTTTCGTTGAAATAAGTAAACTTCATCCGAATTCAATGTATTTCCTCCAAGTACAAACAATAAGTGCATTTGGTGGTGGAAAACTACGAAGTGAAAAGGCTGAAATTTTTTATAACACAACGAGTTCTGAACAGCCACCACAGGCATTAAAAAGGCGTATAGACAACTCCGTAACAGGACTCAGATTAAACAAACTTATATGGTTGAACCATAAGATTAAGGCTAAAATATCATGGGAATTGCCTCCAGGCTCAAAGGGACAATCTAAAAGATATTTTGTGCACTGGAAAACTCTGTCCTGCCAACATCCAGCAACAGAATTAAAGGAATTTTCAGCAATAACCGAGCAAAACAGCTTCGAAATATATGAGTTAGATTACAAATGCAAATACAAAGTAAACGTGAACAGATCTCCGAACAGCGTTACTCCAGACTCCGAATACATTTTATCAGTTCCTGGATGCGATTATTTTAAACGGAAATTTAATAGCTCCTACGTTAAATGTAAAACATAG

Protein sequence:

>DPOGS210680-PA
MWMIKTGVIILAVLISASAKSKRYTRLQSDPLTTTRCDLICFDASKENKSQCRSACRSETQKPGTCPDGDDPRWMAACLEACNHDSQCDGTQRCCQHGCSSTCSEPTDLLTIPGLPAMPSIEEPKERRRAVQIKWSDGVGDEARSVPGRVLYLLEEQHYLCPNYDESRLGEWNLLMRTNKTKVSLRNQLKPGRWYRFRVAAVSASGTRGFSEPSAPFTPRKGPRPPPPPKKLKVEHVRSDNDSVTIRLEWKEPKSDLPVMRYKVFWSRRLRGLSGELDSVVVNHQTVPKDQTFVEISKLHPNSMYFLQVQTISAFGGGKLRSEKAEIFYNTTSSEQPPQALKRRIDNSVTGLRLNKLIWLNHKIKAKISWELPPGSKGQSKRYFVHWKTLSCQHPATELKEFSAITEQNSFEIYELDYKCKYKVNVNRSPNSVTPDSEYILSVPGCDYFKRKFNSSYVKCKT-