Monarch geneset OGS2.0

DPOGS211182
TranscriptDPOGS211182-TA2238 bp
ProteinDPOGS211182-PA745 aa
Genomic positionDPSCF300007 + 474291-482518
RNAseq coverage537x (Rank: top 23%)
Annotation
HeliconiusHMEL0124240.085.41% 
BombyxBGIBMGA003171-TA0.077.06% 
DrosophilaTg-PA0.046.49% 
EBI UniRef50UniRef50_P521830.053.04%Annulin n=29 Tax=Pancrustacea RepID=ANNU_SCHAM
NCBI RefSeqXP_972710.10.052.68%PREDICTED: similar to annulin [Tribolium castaneum]
NCBI nr blastpgi|910829210.052.68%PREDICTED: similar to annulin [Tribolium castaneum]
NCBI nr blastxgi|910829210.052.62%PREDICTED: similar to annulin [Tribolium castaneum]
Group
Gene OntologyGO:00181491e-27peptide cross-linking
GO:00038101e-27protein-glutamine gamma-glutamyltransferase activity
KEGG pathway 
InterPro domain[48-738] IPR0236082.6e-291Protein-glutamine gamma-glutamyltransferase, eukaryota
[42-206] IPR0137838.3e-36Immunoglobulin-like fold
[43-204] IPR0147561e-31Immunoglobulin E-set
[639-739] IPR0089581e-27Transglutaminase, C-terminal
[324-408] IPR0029312.5e-26Transglutaminase-like
[53-180] IPR0011023.9e-26Transglutaminase, N-terminal
Orthology groupMCL14731 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211182-TA
ATGGGTGCTGTGAAGAGTAAGTTGGCTGGCTGCTGTCCGCCTAGATGTGTTTGCTGCGGGTACCACCGGTCAGATTCCTACGATTTAAAAGATCTGCCGAGGCCTCCACGGTTGGATGATTCAAATAATGTCATCGACGGAACCAGACAGGGCGTCCTCGCGGTCCAAGGTATCGACTTTTGCATCGACAGCAATGGGGAAAATCATCACACTAATAAATTCAATCTCATGACAAGAGATGTTGACAGGTGTCTCGTGGTGAGACGAGGTCAGGCTTTTAAATTGGATATCTTATTGAACAGGCCATACGACGCGAAAAGAGATGCCGTGTCTTTCATCTTTTACGTCGAAGATGTGGAGAAGCGTGGGCCGTCGAATGAAGCGTCAGCGGCTGTTCCTATGTTGGAAAAGGGGTCGGAAACTTTAGGTTCGTGGAACGCTGTGTATGAAGGACAAATTGATTCGCATCTGATGGTAGCTGTGACACCAGCGGCTGACTGTATTGTGGCTGCCTGGCGATTGGATGTTGATACTAAGCTCAGCGGTACCGGAGCGCTTAGCTACACACATCCACGACCAATTTATATCATTTTTAATCCTTGGTGTCTTAACGACAGCGTGTATATGCCAGGACACAGTCACCGCGAGGAATATGTTCAAGAGGATGGTGGTCTTGTATTTAGAGGTGTTTATAACAGAATTAAACCGGCTCCGTGGAACTATGCCCAGTACGAGAAAGATATTTTAGAGTGCGCTTTGTATCTTATAAGAGAAGTTGGAAAGGTCAAAGGAGGGGCGAGAGGAGATCCAATTAGGATCGTCCGAGCGTTGGCCGCCGCGGTAAATGTTCAAGACGACAATGGTGTTTTGGTTGGTAACTGGGCCAAAGAACTGAGTGATTACAGTGGCGGAACTCATCCGCTGAAATGGGTCGGATCTTTGGCGATAATACAGAAGTACTACGAGAAGAAAAAGCCGGTTAAATACGCTCAGTGTTGGGTGTACGCTGGGGTTTTGACGACATTATGTAGAGCGCTTGGTATCCCATGCCGACCGGTGTCAGGTTACGATGCAGCTCATGACAGTCAGGGTAGTCTCACTATTGACATCATTAAAGACGAAAACGGCGACACCCTGGAAGAGTTTACCAACGACTCGGTATGGATGGACCGTCCAGATTTGGGACCTGATTATGGCGGCTGGCAGGCCATTGATGCGACGCCTCAGGAAACTTCTGAGGACGTATACCGCTGTGGCCCTGCTTCCTTGAGGGCTGTCCGCGACGGAGAACTCCAGAAACCATATGACGTGTCTTACGTCTTCGCACAAGTTAATGCAGATAAGGTTCTTTGGAAATACTCCGGTGAAATACAACCATTAAAACTTCTAGCACGTGATACGATATCAATAGGACAGAATATATCTACAAAAGCAATTGGGCGAATGGAAAGAGAGGATATAACAAATCTCTACAAATATCCGGAACGAACAAAAGAAGAGCGTGACACAATGGAGAAGGCTCTCCGTAAATCTGAGAGCATTTTCGCAAGATATTATCTAAATGACGCATTTAATGATGTCACCTTCGACTTCGAATTGAGAGACGATATTAAGATAGGACAGGATTTCAACGTTGCACTACATATAAAGAACCGATCTACAATCAACGAGCATCATGTGAAGGGTGTACTGCGAGTTGACACTGTGACGTACACTGGCGTCACCGGGGACGGGGTTAAGAGGCATGAGTTTGACATGAAGATGGCACCAGAAAAGAAAGAAACCATCACACTGCCTGTCACGTTCAATGATTATTTATTCTTTAACCAACTAGCGTCCTTCAACATAGCTTGTTTGGCGTCGATAGTCGACAGGAACTTTGACTATTTCGCACAAGACGACTTCAGAGTTCGCAATCCTGACATTAAGATCTCGATCGACGGCAAACCTATTTCACGACAGGAATTCACCGTAAACGTGAAATTGGAAAATCCTCTTCCGATACCATTAAAGAGCGGGAAATTCTATATTCAAGGACCAGGATTGGATAAACAACTTAAGATCGAACTAAGCGAGAACGTGCCACCAGGCGAGTTTGCGACAGCCCAGTTTCAGTTGACCCCTCCTTGGGCTGGTCGTCATCAGATATCAGCGAAGTTCTCCTCCAAGGAAATGCATGACGTCGACGGTTTCCTGTCGATAATGGTAGCACCACCTGCGTCTAATGGTGTTTCCTAG

Protein sequence:

>DPOGS211182-PA
MGAVKSKLAGCCPPRCVCCGYHRSDSYDLKDLPRPPRLDDSNNVIDGTRQGVLAVQGIDFCIDSNGENHHTNKFNLMTRDVDRCLVVRRGQAFKLDILLNRPYDAKRDAVSFIFYVEDVEKRGPSNEASAAVPMLEKGSETLGSWNAVYEGQIDSHLMVAVTPAADCIVAAWRLDVDTKLSGTGALSYTHPRPIYIIFNPWCLNDSVYMPGHSHREEYVQEDGGLVFRGVYNRIKPAPWNYAQYEKDILECALYLIREVGKVKGGARGDPIRIVRALAAAVNVQDDNGVLVGNWAKELSDYSGGTHPLKWVGSLAIIQKYYEKKKPVKYAQCWVYAGVLTTLCRALGIPCRPVSGYDAAHDSQGSLTIDIIKDENGDTLEEFTNDSVWMDRPDLGPDYGGWQAIDATPQETSEDVYRCGPASLRAVRDGELQKPYDVSYVFAQVNADKVLWKYSGEIQPLKLLARDTISIGQNISTKAIGRMEREDITNLYKYPERTKEERDTMEKALRKSESIFARYYLNDAFNDVTFDFELRDDIKIGQDFNVALHIKNRSTINEHHVKGVLRVDTVTYTGVTGDGVKRHEFDMKMAPEKKETITLPVTFNDYLFFNQLASFNIACLASIVDRNFDYFAQDDFRVRNPDIKISIDGKPISRQEFTVNVKLENPLPIPLKSGKFYIQGPGLDKQLKIELSENVPPGEFATAQFQLTPPWAGRHQISAKFSSKEMHDVDGFLSIMVAPPASNGVS-