BUSpaRse

kallisto | bustools R utilities


Bioconductor version: Release (3.20)

The kallisto | bustools pipeline is a fast and modular set of tools to convert single cell RNA-seq reads in fastq files into gene count or transcript compatibility counts (TCC) matrices for downstream analysis. Central to this pipeline is the barcode, UMI, and set (BUS) file format. This package serves the following purposes: First, this package allows users to manipulate BUS format files as data frames in R and then convert them into gene count or TCC matrices. Furthermore, since R and Rcpp code is easier to handle than pure C++ code, users are encouraged to tweak the source code of this package to experiment with new uses of BUS format and different ways to convert the BUS file into gene count matrix. Second, this package can conveniently generate files required to generate gene count matrices for spliced and unspliced transcripts for RNA velocity. Here biotypes can be filtered and scaffolds and haplotypes can be removed, and the filtered transcriptome can be extracted and written to disk. Third, this package implements utility functions to get transcripts and associated genes required to convert BUS files to gene count matrices, to write the transcript to gene information in the format required by bustools, and to read output of bustools into R as sparses matrices.

Author: Lambda Moses [aut, cre] , Lior Pachter [aut, ths]

Maintainer: Lambda Moses <dlu2 at caltech.edu>

Citation (from within R, enter citation("BUSpaRse")):

Installation

To install this package, start R (version "4.4") and enter:


if (!require("BiocManager", quietly = TRUE))
    install.packages("BiocManager")

BiocManager::install("BUSpaRse")

For older versions of R, please refer to the appropriate Bioconductor release.

Documentation

To view documentation for the version of this package installed in your system, start R and enter:

browseVignettes("BUSpaRse")
Converting BUS format into sparse matrix HTML R Script
Transcript to gene HTML R Script
Reference Manual PDF
NEWS Text
LICENSE Text

Details

biocViews RNASeq, SingleCell, Software, WorkflowStep
Version 1.20.0
In Bioconductor since BioC 3.10 (R-3.6) (5 years)
License BSD_2_clause + file LICENSE
Depends R (>= 3.6)
Imports AnnotationDbi, AnnotationFilter, biomaRt, BiocGenerics, Biostrings, BSgenome, dplyr, ensembldb, GenomeInfoDb, GenomicFeatures, GenomicRanges, ggplot2, IRanges, magrittr, Matrix, methods, plyranges, Rcpp, S4Vectors, stats, stringr, tibble, tidyr, utils, zeallot
System Requirements GNU make
URL https://github.com/BUStools/BUSpaRse
Bug Reports https://github.com/BUStools/BUSpaRse/issues
See More
Suggests knitr, rmarkdown, testthat, BiocStyle, txdbmaker, TENxBUSData, TxDb.Hsapiens.UCSC.hg38.knownGene, txdbmaker, BSgenome.Hsapiens.UCSC.hg38, EnsDb.Hsapiens.v86
Linking To Rcpp, RcppArmadillo, RcppProgress, BH
Enhances
Depends On Me
Imports Me
Suggests Me
Links To Me
Build Report Build Report

Package Archives

Follow Installation instructions to use this package in your R session.

Source Package BUSpaRse_1.20.0.tar.gz
Windows Binary (x86_64) BUSpaRse_1.20.0.zip (64-bit only)
macOS Binary (x86_64) BUSpaRse_1.20.0.tgz
macOS Binary (arm64) BUSpaRse_1.20.0.tgz
Source Repository git clone https://git.bioconductor.org/packages/BUSpaRse
Source Repository (Developer Access) git clone git@git.bioconductor.org:packages/BUSpaRse
Bioc Package Browser https://code.bioconductor.org/browse/BUSpaRse/
Package Short Url https://bioconductor.org/packages/BUSpaRse/
Package Downloads Report Download Stats