Monarch geneset OGS2.0

DPOGS203060
TranscriptDPOGS203060-TA1329 bp
ProteinDPOGS203060-PA442 aa
Genomic positionDPSCF300392 - 2633-12856
RNAseq coverage128x (Rank: top 56%)
Annotation
HeliconiusHMEL0107491e-9890.10% 
BombyxBGIBMGA001703-TA0.074.39% 
Drosophilaklg-PA9e-15060.71% 
EBI UniRef50UniRef50_Q9VCT41e-14760.71%Klingon n=22 Tax=Neoptera RepID=Q9VCT4_DROME
NCBI RefSeqXP_002058363.11e-14860.34%GJ14353 [Drosophila virilis]
NCBI nr blastpgi|1953995122e-14760.34%GJ14353 [Drosophila virilis]
NCBI nr blastxgi|1953995123e-14459.05%GJ14353 [Drosophila virilis]
Group
KEGG pathway 
InterPro domain[228-295] IPR0137834.4e-26Immunoglobulin-like fold
[209-295] IPR0130981e-17Immunoglobulin I-set
[220-287] IPR0035983.9e-10Immunoglobulin subtype 2
[214-296] IPR0035991.2e-07Immunoglobulin subtype
[369-407] IPR0089571.1e-06Fibronectin type III domain
Orthology groupMCL17477 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203060-TA
ATGGAGTGTTGGGTGATACTAGCCGCAGCCCTCGTGACAGCGTTGGCGCAGAATAACAGAAATGATGGACCCAGATTCCTGTCTCGAGGACATTCCTTCAGAACGGTGGTCGGAGACACCCTGCTTTTGCCTTGTCAGGTGCAAAATTTGGGTTCATTAGTACTTCTGTGGAGACGAGGTCCTGCAGTGTTGACGGCTGCCAGCCTGATGGTCACAAGAGATGAAAGGTTCAGATTAGTCGATGGGTACAACCTGCAGATCACTGACGTTGGACCACAGGATGCCGGGGACTACGTATGTCAGATCTCTGATCGCATCGCCCGCGACCAGGTCCATACAGTGGAAGTACTCGTTCCTCCCAGTGTACGCGCATCCCCTGAGTCAAGACACGCGGCCGCCCGTCGCGGGGGTGCCTCCGTACTTGAGTGTCGAGCGTCCGGGAACCCCGTGCCCAGTGTGATATGGCATAAGATGAACGACACCAGCACGCGTCTAGCGGAAGGGCCTCAATTGCAGCTGTCCCGCCTCGAGAGACAACACAGCGGCAAATACATCTGCACAGTCGACAATGGCGTGGGACCTCCCATCGTCGCAGAATTCCAGTTGCAAGTTTTGTATCCCCCGGAGATAACAGTGGACCGCTCGTGGGTCCACACCGGAGAGGGATTCCGTGCTGAATTAAGATGTTCCGTCCTCGCAGATCCACCGGCAGAGGTTCTCTGGTATCAGAATTCTTTCCCTCTCTCAGCGTCCGAGCGTATCACGATGTCCCTTCGCGGTAACAATCACACACTGCTCATAGCCAATGTACAACCAGAGGATTTCGGGAATTACACGTGCGTAGCTGATAACAGTTTGGGTCGCGCGCGCCAGCATGTGGAGGTTTCCGGTAGACCGGGAGCCGCTCGGTTCACATCCTCTCCTCTCGCCAGCTCGAGGACCTCTTACACACTGGCCTTCACCGTCGACAGCTACCCTCCGTTGGACGAGCTCCGACTGCTCTATAGACAACTTGCGATAAACGAGTCATTCCAGCAGCCTGGACGCTGGCATGACGTTGTCCTGCCGCCTCCCTCCCGCCCCACCGCTCTAGCACACCACGTCACCCACGAACTAGTTGGACTTCAGCCTGGAGCTGTCTACGAAGCGATCGTACAAGCGAAAAACAGATACGGCTGGAACGAGGTCAGCGACATATTCCAGTTCCACACTCTCGGCGGATCTCACTCCGTTCACGCGGACGAACTGTCACCTATGTTCTCTTCCGCGGCCATACACCGACTGATGACGTCAGCGGTCGCGTTAGCTTTATATACCCTGCTCGGATGA

Protein sequence:

>DPOGS203060-PA
MECWVILAAALVTALAQNNRNDGPRFLSRGHSFRTVVGDTLLLPCQVQNLGSLVLLWRRGPAVLTAASLMVTRDERFRLVDGYNLQITDVGPQDAGDYVCQISDRIARDQVHTVEVLVPPSVRASPESRHAAARRGGASVLECRASGNPVPSVIWHKMNDTSTRLAEGPQLQLSRLERQHSGKYICTVDNGVGPPIVAEFQLQVLYPPEITVDRSWVHTGEGFRAELRCSVLADPPAEVLWYQNSFPLSASERITMSLRGNNHTLLIANVQPEDFGNYTCVADNSLGRARQHVEVSGRPGAARFTSSPLASSRTSYTLAFTVDSYPPLDELRLLYRQLAINESFQQPGRWHDVVLPPPSRPTALAHHVTHELVGLQPGAVYEAIVQAKNRYGWNEVSDIFQFHTLGGSHSVHADELSPMFSSAAIHRLMTSAVALALYTLLG-