Genetic

profileshanice1
Articletoanswerworksheet.pdf

Smeby et al. Oncogenesis (2019) 8:35 https://doi.org/10.1038/s41389-019-0141-3 Oncogenesis

ART ICLE Open Ac ce s s

Transcriptional and functional consequences of TP53 splice mutations in colorectal cancer Jørgen Smeby1,2,3,4, Anita Sveen1,2,4, Ina A. Eilertsen1,2,4, Stine A. Danielsen1,2, Andreas M. Hoff1,2, Peter W. Eide1,2, Bjarne Johannessen1,2,4, Merete Hektoen1,2, Rolf I. Skotheim 1,2, Marianne G. Guren2,3, Arild Nesbakken2,4,5 and Ragnhild A. Lothe1,2,4

Abstract TP53 mutations are common in colorectal cancer (CRC). Most TP53 sequencing studies have been restricted to coding regions, but recent studies have revealed that splice mutations can generate transcript variants with distinct tumorigenic and prognostic properties. Here, we performed unrestricted sequencing of all coding sequences and splice regions of TP53 in a single-hospital series of 401 primary CRCs. TP53 splice mutations were detected in 4% of the cases (N= 16), considerably more frequent than reported in major databases, and they were mutually exclusive to exon mutations. RNA sequencing revealed high-level expression of aberrant transcript variants in the majority of splice mutated tumors (75%). Most variants were predicted to produce truncated TP53 proteins, including one sample expressing the potentially oncogenic and druggable p53ψ isoform. Despite heterogeneous transcript structures, downstream transcriptional profiling revealed that TP53 splice mutations had similar effects on TP53 target gene expression and pathway activity as exonic mutations. Intriguingly, TP53 splice mutations were associated with worse 5- year relapse-free survival in stage II disease, compared to both TP53 wild-type and exon mutations (P= 0.007). These data highlight the importance of including splice regions when examining the biological and clinical consequences of TP53 mutations in CRC.

Introduction Mutations in the tumor suppressor gene TP53 have

long been known to be integral to colorectal carcino- genesis1,2, substantiated by its high mutation rate in manifest colorectal cancers (CRC)3. The TP53 mutation- spectrum and distribution show a large proportion of missense mutations clustering on hotspot codons, while truncating alterations, including nonsense, frameshift and splice site mutations, constitute ~20–25% of the muta- tions. Specifically, mutations at canonical splice sites

account for approximately 2% of all detected TP53 mutations, according to major databases4,5. However, this may be underestimated since mutation analyses have been restricted to coding regions and the DNA-binding domain in many studies. The consequences of TP53 splice mutations on the

expression of the corresponding transcript variants are diverse, with smaller studies providing evidence of alter- native splicing by exon skipping, intron retention, gen- eration of novel splice sites and usage of cryptic splice sites6. On the other hand, the effects of TP53 splice mutations on downstream target gene expression and pathway activity have not been elucidated. Intriguingly, some truncating TP53 mutations have been shown to produce transcriptionally inactive isoforms leading to activation of a prometastatic cellular program7,8. These

© The Author(s) 2019 OpenAccessThis article is licensedunder aCreativeCommonsAttribution 4.0 International License,whichpermits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if

changesweremade. The images or other third partymaterial in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to thematerial. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Correspondence: Ragnhild A. Lothe ([email protected]) 1Department of Molecular Oncology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway 2K.G. Jebsen Colorectal Cancer Research Centre, Division of Cancer Medicine, Oslo University Hospital, Oslo, Norway Full list of author information is available at the end of the article.

Oncogenesis

12 34

56 78

90 () :,;

12 34

56 78

90 () :,;

1 2 3 4 5 6 7 8 9 0 () :,;

12 34

56 78

90 () :,;

separation-of-function isoforms possess distinct onco- genic properties with potentially prognostic and ther- apeutic relevance. In a single-hospital series of 401 primary CRCs, we

investigated the downstream consequences of splice mutations in TP53, including expression of aberrant transcript variants, target genes, pathway activity and potential clinical consequences.

Results The spectrum of coding and splice mutations in TP53 TP53 was mutated in 60% (N= 241) of the 401 cases,

including 4% (N= 16) with splice mutations. The total number of mutations was 252, with double mutations in 2.7% (N= 11) of the tumors. Missense mutations were most common (69%, N= 173), while splice mutations accounted for 6% of all TP53 mutations (Fig. 1a) and were more frequent than the corresponding 2% reported in the IARC TP53 database4. Of the 16 TP53 splice mutations, one had not been previously described in cancer, and nine were novel to CRC (according to the IARC TP53 Database in April 2016; Supplementary Table S1)4. Thirteen were point mutations at the consensus splice sites, five in the donor site and eight in the acceptor site, of which 12 were single base substitutions and one was a single base dele- tion (Table 1). The three remaining splice mutations consisted of a point mutation located five basepairs downstream of exon 4, a duplication of six basepairs spanning the exon 7/intron 7 boundary and a deletion of 14 basepairs spanning the intron 5/exon 6 boundary. The majority of splice mutations (N= 13, 81%) was located adjacent to exons 5 to 8, reflecting the distribution of TP53 exon mutations (Fig. 1b). Notably, all splice muta- tions were mutually exclusive to TP53 exon mutations.

Diverse aberrant transcript variants caused by TP53 splice mutations RNA sequencing revealed altered splicing patterns

corresponding with splice mutations in 12 (75%) of the 16 mutated samples, including exon skipping events, intron retention and usage of cryptic splice acceptor and donor sites not detected in an unmatched normal colonic mucosa sample (Table 1, Fig. 2a–c, Supplementary Fig. 1). Two different aberrant transcript variants per sample, in addition to the canonical splice variant, were identified in six samples (Table 1, Supplementary Fig. 1). Disparate splicing consequences between samples with mutations affecting the same canonical splice site were also observed. Samples 8 and 16 harbored base substitutions one and two nucleotides upstream of exon 8, respectively, but only the former exhibited aberrant splicing in this region (Fig. 2b). Furthermore, the estimated expression levels of the aberrant transcript variant(s) relative to the canonical splicing isoform per sample ranged from 7 to

88%, with a median of 31% (Fig. 2a, Supplementary Table S2). Collectively, this shows diverse effects of splice mutations on expressed transcript structures. Nonetheless, most of the aberrant transcript variants

(N= 11, 85%) introduced a premature stop codon or altered reading frame, and were predicted to produce truncated TP53, if not degraded by nonsense-mediated decay (Table 1). Among these was the truncated p53ψ isoform which has previously been implicated in can- cer8. This was expressed in one sample harboring a point mutation two basepairs upstream of exon 7 (c.673–2A, sample 10, Fig. 2c), as a result of activation of a cryptic splice acceptor site located 49 basepairs upstream of the canonical splice site in intron 6. This separation-of-function isoform has been shown to have poor prognostic associations and to induce epithelial to mesenchymal transition (EMT) in a transcriptionally independent manner8.

TP53 splice mutations lead to reduced expression of TP53 target genes To analyze the downstream transcriptional con-

sequences of TP53 splice mutations relative to exon mutations, TP53 gene expression was compared according to mutation type. Samples with missense mutations had the highest TP53 expression levels, similar to the expression in tumors with TP53 wild type (Fig. 3a). Samples with frameshift and nonsense muta- tions had a significant reduction in the expression of TP53 (P < 0.001), likely because several of these tran- scripts are eliminated by nonsense-mediated mRNA decay. Samples with splice mutations had a distinct intra-group dichotomy of high and low TP53 expression, but this was not related to the expression of truncating isoforms. However, differential gene expression analysis showed that both tumors with exon mutations (irre- spective of the type) and splice mutations had down- regulation of known TP53 transcriptional targets in comparison with wild-type tumors, including MDM2, FAS, HSPA4L and SPATA18 among the top five differ- entially expressed genes (Supplementary Table 3a)9. No genes were differentially expressed between the two mutation groups. Furthermore, to compare TP53 path- way activity, a single-sample gene set expression enrichment score was calculated for a set of 200 genes involved in the pathway10. All mutation types were associated with significantly lower TP53 signature scores than wild-type samples (P < 0.01), with only minor dif- ferences among the mutation types (Fig. 3b, Supple- mentary Table 3b). Accordingly, the downstream transcriptional consequences of TP53 splice mutations were similar to those of exon mutations. No significant differences between TP53 splice and exon

mutations with respect to clinicopathological and

Smeby et al. Oncogenesis (2019) 8:35 Page 2 of 8

Oncogenesis

molecular associations were found (Supplementary Table 4) and neither splice- nor exon-mutations were associated with survival across all stages (Supplementary Fig. 2a). However, stratification according to TNM stage showed that patients with TP53 splice mutations in stage II had worse 5-year RFS (P= 0.007, Fig. 3c) compared to both TP53 wild-type and exon mutations, although the sample number was low. The same trend was also seen in mul- tivariable analysis comparing splice mutations to exon mutations (RFS: hazard ratio (HR) 4.82 (1.14–20.43); P= 0.033, Supplementary Fig. 2b). Of note, splice mutations were associated with inferior prognosis in stage II com- pared with other truncating mutation types, i.e. nonsense and frameshift mutations (P= 0.02, Fig. 3d), suggesting that the prognostic impact is not due to the loss of full- length TP53 protein per se.

Discussion We show that splice mutations account for 6% of all

TP53 mutations in CRC, which is threefold higher than estimated in major databases4,5. With respect to expres- sion of corresponding aberrant transcript variants, the splice mutations show great diversity, but on the protein level, the majority are predicted to cause truncated iso- forms, in line with the prevailing notion of the effects of such mutations. Nonetheless, all mutation types analyzed in this study, both splice mutations and the different types of exonic mutations, converged on downregulation of both TP53 target genes and signatures of TP53 pathway activity. This, together with the mutual exclusivity to exon mutations, indicate that TP53 splice mutations have similar biological relevance as alterations in protein- coding sequences.

Missense

Frameshift

Nonsense

Splice

In frame indel

Silent

0.8%1.2%

68.7%12.3%

10.7%

6.3%

TAD PRD DBD OD Neg

1 5 7 8 9 10 11Exon

domain NLS

175

213 245

273

248

282

62

Fr eq

ue nc

y (%

)

3 4

12

10

8

6

4

2

0

1 63 64 92 102 306 316 325 355 364 393Codon

A

B

Fig. 1 TP53 mutation spectrum in primary colorectal cancers. a Proportion of TP53 mutation types, plotted as the percentage of the total number of detected mutations (N= 252). b Frequency and distribution of TP53 splice (red) and exon (gray) mutations at each position along the gene (codon number is indicated below). The frequency is calculated relative to all mutations included in this plot (N= 236; exon mutations encompassing multiple codons, i.e. indels, are excluded). Frequency of splice mutations represents all mutations affecting the relevant splice region; see Table 1 for details. TAD transactivation domain, PRD proline-rich domain, DBD DNA-binding domain, NLS nuclear localization domain, OD oligomerization domain, Neg negative-regulation domain

Smeby et al. Oncogenesis (2019) 8:35 Page 3 of 8

Oncogenesis

Ta b le

1 C h ar ac te ri st ic s of

TP 53

sp lic e m ut at io n s an

d co

rr es p on

d in g tr an

sc ri p t va

ri an

ts

Sa m p le

no .a

TP 53

sp lic e m ut at io n

c_ d es cr ip ti on

b A b er ra nt

tr an

sc ri p t va

ri an

t

(s )

D is tu rb ed

re ad

in g fr am

e Pr em

at ur e st op

co d on

Pr ed

ic te d p ro te in

p ro du

ct

1 Ex on

3+ 1 (S D ), G > A

c. 96 + 1 G > A

Ex on

3 sk ip pi ng

Ye s

Ye s

Tr un

ca te d

2 Ex on

5– 1 (S A ), G > A

c. 37 6– 1 G > A

Cr yp tic

SA ex on

5 N o

N o

Lo ss

of 7 A A s 5′

ex on

5

3 Ex on

7+ 1 (S D ), G > A

c. 78 2+

1 G > A

In tr on

7 re te nt io n

Ye s

Ye s

Tr un

ca te d

4 Ex on

5– 1 (S A ), G > A

c. 37 6– 1 G > A

Cr yp tic

SA ex on

5 N o

N o

Lo ss

of 7 A A s 5′

ex on

5

5 Ex on

9– 2 (S A ), A > G

c. 92 0– 2 A > G

(I) In tr on

8 re te nt io n

(II ) Ex on

9 sk ip pi ng

(I) N o

(II ) Ye s

(I) Ye s

(II ) Ye s

(I) Tr un

ca te d

(II ) Tr un

ca te d

6 Ex on

8+ 1 (S D ), G > A

c. 91 9+

1 G > A

(I) In tr on

8 re te nt io n

(II ) Ex on

8 sk ip pi ng

(I) N o

(II ) Ye s

(I) Ye s

(II ) Ye s

(I) Tr un

ca te d

(II ) Tr un

ca te d

7 Ex on

5– 1 (S A ), G > T

c. 37 6– 1 G > T

Cr yp tic

SA ex on

5 N o

N o

Lo ss

of 7 A A s 5′

ex on

5

8 Ex on

8– 1 (S A ), G > T

c. 78 3– 1 G > T

(I) In tr on

7 re te nt io n

(II ) C ry pt ic SA

ex on

8

(I) Ye s

(II ) Ye s

(I) Ye s

(II ) N o

(I) Tr un

ca te d

(II )L os s of

9 A A 5′ ex on

8, di st ur be

d re ad in g fra m e

9 Ex on

4+ 5, G > A

c. 37 5+

5 G > A

In tr on

4 re te nt io n

N o

Ye s

Tr un

ca te d

10 Ex on

7– 2 (S A ), A > T

c. 67 3– 2 A > T

(I) In tr on

6 re te nt io n

N o

Ye s

Tr un

ca te d

(II ) C ry pt ic SA

in tr on

6 Ye s

Ye s

Tr un

ca te d

11 Ex on

5+ 1 (S D ), G > A

c. 55 9+

1 G > A

(I) In tr on

5 re te nt io n

(II ) C ry pt ic SD

ex on

5

(I) Ye s

(II ) Ye s

(I) Ye s

(II ) Ye s

(I) Tr un

ca te d

(II ) Tr un

ca te d

12 Ex on

6– 1 (S A ), G > A

c. 56 0– 1 G > A

(I) In tr on

5 re te nt io n

(II ) Ex on

6 sk ip pi ng

(I) Ye s

(II ) Ye s

(I) Ye s

(II ) Ye s

(I) Tr un

ca te d

(II ) Tr un

ca te d

13 Ex on

5+ 1 (S D ), de

lG c. 55 9+

1d el 1

N ot

pr es en

t N /A

N /A

Fu ll le ng

th

14 Ex on

6– 9 (S A ), de

l1 4 bp

c. 56 0– 9_ 56 4d

el 14

N ot

pr es en

t N /A

N /A

Fu ll le ng

th

15 Ex on

7+ 3, in s 6 bp

c. 78 2+

3_ 78 2+

4i ns 6

N ot

pr es en

t N /A

N /A

Fu ll le ng

th

16 Ex on

8– 2 (S A ), A > G

c. 78 3– 2 A > G

N ot

pr es en

t N /A

N /A

Fu ll le ng

th

SD co ns en

su s sp lic e do

no r si te ,S A co ns en

su s sp lic e ac ce pt or

si te ,A

A am

in o ac id

a S am

pl es

or de

re d ac co rd in g to

es tim

at ed

re la tiv

e ex pr es si on

le ve l of

ca no

ni ca l tr an

sc rip

ts ,a

s sh ow

n in

Fi g.

2a b M ut at io ns

ar e de

sc rib

ed ac co rd in g to

th e H um

an G en

om e Va

ria tio

n So

ci et y (H G VS

) no

m en

cl at ur e an

d us in g th e N M _0

00 54

6. 4 co di ng

se qu

en ce

as re fe re nc e

Smeby et al. Oncogenesis (2019) 8:35 Page 4 of 8

Oncogenesis

1 0

50

100 TP 53

tr an

sc rip

t v ar

ia nt

s ( %

)

2 3 4 8 105 6 7 9 11 12 13 14 15 16

Sample no.

Exon skipping

Canonical

in frame

61

36

19

135

0

RP KM

112

0

RP KM

Sample 10

Normal sample

6noxE7noxE Intron 6 Cryptic splice acceptor site

4

11

1

1111

mRNA

Cryptic splice acceptor site c.673-2A>T

4

11

1

1111

mRNA

Intron retention

Canonical p53

4

11

1 1111

11

11

4

11

1 1111

DNA

mRNA

TAD PRD DBD OD NegNLSProtein

A

B

C

Sample 8

340

0

RP KM

Cryptic splice acceptor site

104

642

Sample 16

728

0

RP KM

Normal sample

112

0

RP KM

7noxE8noxE Intron 7 20

128

4 111 1111mRNA

Cryptic splice acceptor site

4 111 11mRNA

Intron retention c.783-1G>T

Fig. 2 (See legend on next page.)

Smeby et al. Oncogenesis (2019) 8:35 Page 5 of 8

Oncogenesis

Intriguingly, TP53 splice mutations were associated with a poor prognosis in stage II in our patient series. However, the low number of splice mutated samples within each stage and the lack of a detrimental effect of splice mutations across stages warrants cautious inter- pretation. Still, TP53 splice mutations have previously been shown to be enriched in metastatic CRC, an asso- ciation not seen for TP53 mutations in general7. This could conceivably be explained by the generation of novel transcript variants with distinct biological properties beyond their direct effects on the TP53 network, and a propensity for relapse and metastasis, as described for p53ψ and other truncating mutations in exon 67. Our finding of stage-specific inferior prognosis for TP53 splice mutations compared with other truncating mutations supports the notion of these mutations having biological consequences beyond the loss of wild-type TP53 activity. Studies on non-small cell lung cancer have shown that mutations in the splice acceptor site of intron 6 generate the prometastatic isoform p53ψ, conveying poor prog- nostic properties. The EMT-inducing properties of this isoform can be inhibited through targeting its down- stream effector Cyclophilin D by drugs such as cyclos- porine A8. To our knowledge, this is the first study to detect the presence and underlying genesis of this potentially prognostic and druggable transcript variant in CRC. We cannot rule out distinct oncogenic properties of the other splice mutations detected in this study. The lack of RNA sequencing data and TP53 transcript

variant analysis from tumors with nonsplice mutations is a potential limitation of our study. Pan-cancer analysis has shown that synonymous and missense exonic mutations can cause aberrant splicing of tumor suppressor genes in general and most frequently in TP53. This occurs most commonly through mutations at residues adjacent to splice junctions11,12. Our study cannot rule out such

splice-altering exonic mutations, but only two mutations in such residues were detected. This suggests only a very limited subset of splice-altering synonymous and mis- sense mutations leading to truncated TP53 protein with distinct impact on TP53 pathway activity and clinical outcome. In conclusion, splice mutations account for a 6% sub-

group of TP53-mutated CRCs, which are mutually exclusive to exon mutations, cause aberrant transcript variant expression, downregulation of downstream target gene expression and pathway activity, and may identify high-risk stage II patients.

Material and methods Patient material A total of 401 fresh-frozen primary CRCs from a

population-representative series of patients operated for CRC stage I−IV at Oslo University Hospital, Norway between 2005 and 2014 were included. DNA extraction was performed as previously described13,14. Comprehen- sive clinicopathological data were prospectively registered for all patients. The research conformed to the Helsinki Declaration and was approved by the Regional Committee for Medical and Health Research Ethics (REC number 1.2005.1629). Written consent was obtained from all patients. The research biobanks have been registered according to national legislation.

Mutation analyses, microsatellite instability status and CMS classification TP53 mutation status was assessed in all 401 samples

using Sanger sequencing of the entire coding region (exons 2–11), as well as the first ten and last ten nucleotides of each intron. Splice mutations were defined as any mutation affecting any of these intronic regions, based on the finding that intron mutations outside the

(see figure on previous page) Fig. 2 Transcript variants in TP53 splice mutated colorectal cancers. a Estimated relative expression levels of TP53 transcript variants in 16 splice mutated samples. Samples are ordered according to the relative expression level of the canonical splicing variant. Canonical splicing, cryptic splice sites and exon skipping events were quantified in the Sashimi plots, while intron retention values are the median depth of the relevant intronic region as measured by IRFinder. b Sashimi plots from two tumor samples with point mutation in the canonical splice acceptor site of intron 7 (marked with a dashed line) compared with a normal sample. Reads spanning exon junctions are represented by arcs, and each arc is labeled with the number of supporting reads. The arc representing an aberrant splicing event is colored in orange. Heights of bars reflect the read depth at each genomic position (reading frame right to left). Schematic visualization of the canonical TP53 transcript variant is shown in the top panel, and the two aberrant variants caused by the splice site mutation below, with coding sequences in light gray and noncoding sequences in black. In sample 8, 20 junction reads span transcripts using a cryptic acceptor site located 24 basepairs into exon 8, while 106 reads retain intron 7 (median depth of intron 7 as measured by IRFinder). Contrastingly, in both sample 16 and the normal sample all junction reads between exon 7 and exon 8 span the canonical splice sites. The transcript variant retaining intron 7 is predicted to generate a premature stop codon. The usage of an alternative splice acceptor site will lead to loss of nine amino acids in the 5′ part of exon 8 followed by disturbed reading frame but no generation of a premature stop codon. c Sashimi plot visualizing aberrant splicing variants in a sample with a TP53 mutation in the splice acceptor site of intron 6 compared with a nonmatched normal colonic mucosa sample. The transcript variant using a cryptic splice acceptor site located 49 basepairs upstream of the canonical splice site is identical with the p53ψ isoform, which is truncated due to the introduction of a premature stop codon. The aberrant transcript due to intron retention contains a premature stop codon in intron 6. Sashimi plots for the remaining samples with TP53 splice mutations are shown in Supplementary Fig. 1

Smeby et al. Oncogenesis (2019) 8:35 Page 6 of 8

Oncogenesis

invariant AG and GT dinucleotides in the consensus splice acceptor (SA) and splice donor (SD) site, respec- tively, may lead to missplicing15. Tumors harboring only synonymous mutations were classified as TP53 wild-type (wt) in survival analysis. Microsatellite instability (MSI) analysis, consensus molecular subtype (CMS) classifica- tion and mutation analyses for KRAS (exon 2: codons 12

and 13) and BRAF (codon 600) were performed in all samples as previously described16–19. KRAS exon 3 codon 61 was analyzed in a subset of samples (N= 127).

Differential gene expression and gene set enrichment analysis All samples have previously been analyzed for gene

expression at the exon-level using Affymetrix GeneChip® Human Exon 1.0 ST Array (HuEx, N= 199)13,20 and are available from GEO (GSE24550, GSE29638, GSE69182, GSE79959), or Human Transcriptome Array 2.0 (HTA 2.0, N= 202) (GSE79959)21 (GSE9652)18. Differential gene expression analysis and single-sample gene set enrichment analysis were performed using the R packages limma22 and GSVA23, respectively. The “HALL- MARK_P53_PATHWAY” gene set from the Molecular Signatures Database (v5.2)10 was used to assess differ- ential TP53 pathway activity among mutation groups. Dunn’s test implemented in the R package dunn.test was used for statistical significance testing. This is a non- parametric post hoc test after Kruskal−Wallis rank sum tests to identify medians that are significantly different between the possible pairs.

RNA sequencing, alignment and transcript variant analysis RNA sequencing was performed for all samples har-

boring TP53 splice mutations (N= 16) detected by Sanger sequencing, and one normal sample for comparison. Strand-specific libraries for TruSeq total RNA sequencing (Illumina Inc.) were prepared according to protocol. Sequencing was performed on the Illumina HiSeq 2500 platform (2 × 101 bp, paired end) to an average of 77.4 million read pairs per sample. FastQC was run for quality control, and reads were aligned to the hg38 (HGMF genome_snp_tran) reference using HiSat2 v. 2.0.4 24. Aligned reads were inspected for transcript variants of TP53 using the Integrative Genome Viewer (IGV). The discovered transcript variants were visualized with the IGV built-in Sashimi plot function25. Exon skipping, usage of cryptic splice sites and canonical splicing in the relevant TP53 splice regions were detected and quantified by the Sashimi plot estimates, further described in legend to Fig. 2b. To avoid false positives due to sequencing or alignment artifacts, only aberrant transcript variants detected in ≥5 reads and accounting for ≥5% of all detected relevant reads in the sample are considered. As intron retention is not quantified by the Sashimi plots, we used the IRFinder algorithm to detect and quantify this class of splice variants26. Only events detected by the IRFinder algorithm with default settings were con- sidered26. The threshold for calling intron retention was set at a minimum of 10% of transcripts retaining the intron (IR ratio > 0.1) and with a minimum coverage of three reads for each base pair of the intron, after

Fig. 3 Transcriptional consequences and prognostic associations of TP53 splice mutations. a TP53 gene expression levels according to mutation type. Due to small sample sizes, in frame indels and silent mutations were not included. b TP53 signature score according to mutation type. Due to small sample sizes, in frame indels and silent mutations were not included. ssGSEA single-sample gene set enrichment analysis. c Kaplan−Meier survival curve showing 5-year relapse-free survival (RFS) for patients with TP53 wt, exon mutations and splice mutations in CRC stage II. d 5-year RFS according to TP53 mutation type in CRC stage II. Due to small sample size, in frame indels and silent mutations (N= 1) were not included

Smeby et al. Oncogenesis (2019) 8:35 Page 7 of 8

Oncogenesis

excluding nonmeasurable intronic regions, as recom- mended by the authors. The expression levels for intron retention events are reported as the median depth of the relevant intronic region, given as output by IRFinder. Intron 4 of TP53 could not be evaluated by IRFinder, due to exclusion of low complexity regions or overlapping feature annotations preventing unique mapping of reads. For samples with splice mutations affecting this region, evidence of intron retention was visually analyzed in the Sashimi plots and quantified by the Bedtools coverageBed function27.

Statistical analyses Statistical analyses were performed using the SPSS

21.0 software (SPSS Inc.). Fisher’s exact or Spearman correlation tests were applied when appropriate to eval- uate associations between categorical variables. Five-year overall survival (OS) and 5-year relapse-free survival (RFS) plots were generated by the Kaplan–Meier method and the log-rank test was used to compare plots. The Cox proportional hazards model was used for univariable and multivariable analyses to assess the independence of prognostic factors. RFS and OS were calculated with date of surgery as starting point. All tests are two-tailed and P values < 0.05 were considered significant.

Acknowledgements This work was supported by the Norwegian Cancer Society (project numbers 6824048-2016 to A.S. and 182759-2016 to R.A.L.); the foundation Stiftelsen Kristian Gerhard Jebsen; and the Research Council of Norway (FRIPRO Toppforsk, project number 250993 to R.A.L.).

Author details 1Department of Molecular Oncology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway. 2K.G. Jebsen Colorectal Cancer Research Centre, Division of Cancer Medicine, Oslo University Hospital, Oslo, Norway. 3Department of Oncology, Oslo University Hospital, Oslo, Norway. 4Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway. 5Department of Gastroenterological Surgery, Oslo University Hospital, Oslo, Norway

Authors’ contributions Conception and design: J.S., A.S. and R.A.L. Development of methodology: A.S, A.M.H., R.I.S. and R.A.L. Acquisition of data: J.S., I.A.E., S.A.D., M.H., M.G.G. and A.N. Analysis and interpretation of data: J.S., A.S., I.A.E., S.A.D., A.M.H., P.W.E., B.J., A.N. and R.A.L. Writing of the manuscript: J.S., A.S., M.G.G. and R.A.L. Review and/or revision of the manuscript: all authors. Study supervision: A.S. and R.A.L.

Conflict of interest The authors declare that they have no conflict of interest.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information accompanies this paper at (https://doi.org/ 10.1038/s41389-019-0141-3).

Received: 1 November 2018 Revised: 17 March 2019 Accepted: 5 April 2019

References 1. Fearon, E. R. & Vogelstein, B. A genetic model for colorectal tumorigenesis. Cell

61, 759–767 (1990). 2. Lothe, R. A. et al. Molecular genetic studies of tumor suppressor gene regions

on chromosomes 13 and 17 in colorectal tumors. J. Natl. Cancer Inst. 84, 1100–1108 (1992).

3. TCGA. Comprehensive molecular characterization of human colon and rectal cancer. Nature 487, 330–337 (2012).

4. Bouaoun, L. et al. TP53 variations in human cancers: new lessons from the IARC TP53 database and genomics data. Hum. Mutat. 37, 865–876 (2016).

5. Leroy, B., Anderson, M. & Soussi, T. TP53 mutations in human cancer: database reassessment and prospects for the next decade. Hum. Mutat. 35, 672–688 (2014).

6. Holmila, R., Fouquet, C., Cadranel, J., Zalcman, G. & Soussi, T. Splice mutations in the p53 gene: case report and review of the literature. Hum. Mutat. 21, 101–102 (2003).

7. Shirole, N. H. et al. TP53 exon-6 truncating mutations produce separation of function isoforms with pro-tumorigenic functions. Elife 5, e17929 (2016).

8. Senturk, S. et al. p53Psi is a transcriptionally inactive p53 isoform able to reprogram cells toward a metastatic-like state. Proc. Natl Acad. Sci. USA 111, E3287–E3296 (2014).

9. Fischer, M. Census and evaluation of p53 target genes. Oncogene 36, 3943–3956 (2017).

10. Liberzon, A. et al. Molecular signatures database (MSigDB) 3.0. Bioinformatics 27, 1739–1740 (2011).

11. Jung, H. et al. Intron retention is a widespread mechanism of tumor- suppressor inactivation. Nat. Genet. 47, 1242–1248 (2015).

12. Giacomelli, A. O. et al. Mutational processes shape the landscape of TP53 mutations in human cancer. Nat. Genet. 50, 1381–1387 (2018).

13. Agesen, T. H. et al. ColoGuideEx: a robust gene classifier specific for stage II colorectal cancer prognosis. Gut 61, 1560–1567 (2012).

14. Berg, M. et al. Distinct high resolution genome profiles of early onset and late onset colorectal cancer integrated with gene expression data identify candi- date susceptibility loci. Mol. Cancer 9, 100 (2010).

15. Anderson, S. L. et al. Familial dysautonomia is caused by mutations of the IKAP gene. Am. J. Hum. Genet. 68, 753–758 (2001).

16. Dienstmann, R. et al. Prediction of overall survival in stage II and III colon cancer beyond TNM system: a retrospective, pooled biomarker study. Ann. Oncol. 28, 1023–1031 (2017).

17. Vedeld, H. M. et al. CpG island methylator phenotype identifies high risk patients among microsatellite stable BRAF mutated colorectal cancers. Int J. Cancer 141, 967–976 (2017).

18. Sveen, A. et al. Colorectal cancer consensus molecular subtypes translated to preclinical models uncover potentially targetable cancer cell dependencies. Clin. Cancer Res. 24, 794–806 (2018).

19. Smeby, J. et al. CMS-dependent prognostic impact of KRAS and BRAFV600E mutations in primary colorectal cancer. Ann. Oncol. 29, 1227–1234 (2018).

20. Sveen, A. et al. ColoGuidePro: a prognostic 7-gene expression signature for stage III colorectal cancer patients. Clin. Cancer Res. 18, 6001–6010 (2012).

21. Sveen, A. et al. Multilevel genomics of colorectal cancers with microsatellite instability-clinical impact of JAK1 mutations and consensus molecular subtype 1. Genome Med. 9, 46 (2017).

22. Ritchie, M. E. et al. Limma powers differential expression analyses for RNA- sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).

23. Hanzelmann, S., Castelo, R. & Guinney, J. GSVA: gene set variation analysis for microarray and RNA-Seq data. Bmc Bioinforma. 14, 7 (2013).

24. Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).

25. Katz, Y. et al. Quantitative visualization of alternative exon expression from RNA-seq data. Bioinformatics 31, 2400–2402 (2015).

26. Middleton, R. et al. IRFinder: assessing the impact of intron retention on mammalian gene expression. Genome Biol. 18, 51 (2017).

27. Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).

Smeby et al. Oncogenesis (2019) 8:35 Page 8 of 8

Oncogenesis

  • Transcriptional and functional consequences of TP53 splice mutations in colorectal cancer
    • Introduction
    • Results
      • The spectrum of coding and splice mutations in TP53
      • Diverse aberrant transcript variants caused by TP53 splice mutations
      • TP53 splice mutations lead to reduced expression of TP53 target genes
    • Discussion
    • Material and methods
      • Patient material
      • Mutation analyses, microsatellite instability status and CMS classification
      • Differential gene expression and gene set enrichment analysis
      • RNA sequencing, alignment and transcript variant analysis
      • Statistical analyses
    • ACKNOWLEDGMENTS