Long intergenic non-coding RNA (lincRNA) are transcripts longer than 200 nucleotides which have a diverse sets of regulatory functions but do not get translated into protein. lincRNAs are located between the protein coding genes and do not overlap exons of either protein-coding or other non-lincRNA. However precise role of individual lincRNA in disease biology remains unclear. Here, we have evaluated the lincRNA expression and their potential biological functions in MM.
We performed RNA-seq on CD138+ MM cells from 296 newly diagnosed patients and 16 normal bone marrow plasma cells (NBM) and analyzed for lincRNA expression. Data from paired-end RNAseq reads were mapped to the latest human genome, differentially expressed lincRNAs were identified and for each expressed lincRNA event free survival was examined with univariate cox regression model and support vector machine. Finally, we identified protein coding genes that are strongly correlated (cor > 0.5) with lincRNAs with significant altered expression in MM and impact on EFS to identify their biological role.
lincRNA and protein coding genes that have more than 10 reads/million reads for at least 15 normal samples or 62 MM samples (20% all MM samples) were included in the analysis. We identified 60 differentially expressed lincRNA (adj p value <0.05), 51 of those had at least 1.5 fold change difference. The differentially expressed lncRNAs were in close proximity of Ig-related genes, genome stability related genes, hosting miRNAs such as mir222 and mir22 and previously reported for other cancers (PVT and TTY15). We evaluated relation of these lincRNAs with event free survival (EFS) and observed 6 lincRNAs associated with shorter EFS. We have developed multivariate signature model to predict EFS by using these 6 lincRNAs. We divided our dataset into training (n=99) and test (n=156) dataset and we utilized support vector machine classification to divide samples into 2 groups using six lincRNAs. This model was able to predict good and poor survival groups in training dataset (p val < 0.001) as well as test dataset (p val = 0.002) (Figure).
We examined genome wide correlation between these six differentially expressed and prognostically significant lincRNAs to expressed protein coding genes to identify their biological functions in MM. Four of these lincRNAs strongly correlated with 47 to 504 genes (abs(cor) > 0.5), affecting immune system pathways and pathways in cancer including Jak-STAT signaling pathway. We also found that these lincRNAs are also highly correlated with tumor development genes such as TNFRSF1B,FGR,TP53BP2,TNF and T or B cells related genes PIK3CD, BCL6. In addition, two of these lincRNAs (LINC00936 and CTB-61M7.2) were found highly correlated with their protein coding neighbor genes ATP2B1(cor = 0.45) and FCAR (cor = 0.95) respectively and MIR22HG was host gene for mir22 which may indicate lincRNAs are using different machinery in MM to regulate protein coding genes.
In summary, we report that lincRNA is differentially expressed and prognostically significant in myeloma and may function through their impact on immune system and tumor progression. Our ongoing integrative approach will provide further evidence of their regulatory role in MM with potential therapeutic application.
Anderson:acetylon pharmaceuticals: Equity Ownership; Celgene Corporation: Consultancy; Gilead: Consultancy; Oncocorp: Equity Ownership; Millennium: Consultancy; BMS: Consultancy. Munshi:onyx: Membership on an entity's Board of Directors or advisory committees; celgene: Membership on an entity's Board of Directors or advisory committees; novartis: Membership on an entity's Board of Directors or advisory committees; millenium: Membership on an entity's Board of Directors or advisory committees.
Asterisk with author names denotes non-ASH members.