About
Contact
Work
Flint

CastFlyer

About
Contact
Work
Flint
Homepage-Image-45.png

🖥

Hello, I'm Camilo Valdes, Ph.D., a computer scientist, researcher, and developer. This is where I write about my research and work.

Featured
Aug 9, 2021
Jasper, Microbiome Map, DNA Sequencing, macOS
Microbiome Maps
Aug 9, 2021
Jasper, Microbiome Map, DNA Sequencing, macOS

Microbiome Maps are visualizations of microbial community profiles, and they can be created with the Jasper software. Jasper is a tool for creating rich, interactive microbiome maps that lets you explore your metagenomic samples like never before. Jasper uses a Hilbert Curve to place genomes on an interactive canvas that can display thousands of genomes at once.

Read More →
Aug 9, 2021
Jasper, Microbiome Map, DNA Sequencing, macOS
Jul 8, 2019
Paper: Large Scale Microbiome Profiling in the Cloud
Jul 8, 2019

The paper for Flint just got published! You can view the publication at Oxford Bioinformatics. Flint is a metagenomics profiling pipeline that is built on top of the Apache Spark framework, and is designed for fast real-time profiling of metagenomic samples against a large collection of reference genomes.

Read More →
Jul 8, 2019
May 15, 2019
Paper Accepted at ISMB 2019
May 15, 2019

Our paper, Large Scale Microbiome Profiling in the Cloud, got accepted for a Proceedings Presentation at the 2019 Intelligent Systems for Molecular Biology and European Conference on Computational Biology (ISMB / ECCB) conference in Basel, Switzerland!

Read More →
May 15, 2019
Mar 25, 2019
PhD
PhD Proposal Defense
Mar 25, 2019
PhD

I recently defended my PhD proposal at the CS department at Florida International University (FIU). I’m currently working on the presentation and the talk is scheduled for April.

Read More →
Mar 25, 2019
PhD
Jan 16, 2018
Spark, virtual machine
Properly Shutting Down a VirtualBox Virtual Machine
Jan 16, 2018
Spark, virtual machine

We’ve been testing some Spark code that will eventually be moved to AWS. For now, to save costs, we’ve created a 8 node Spark cluster that runs on a set of Virtual Machines running Ubuntu on VirtualBox. We’ve developed some bash-scripts to make starting (and shutting down) the VMs easy.

Read More →
Jan 16, 2018
Spark, virtual machine
Apr 14, 2017
Computers and Intractability Book
Apr 14, 2017

Got a copy of a great book, Computers and Intractability: A Guide to the Theory of NP-Completeness, from Bell Labs.

Read More →
Apr 14, 2017
Oct 25, 2016
sequencing, dna
Genome Building
Oct 25, 2016
sequencing, dna

The Bioinformatics repository at my GitHub account contains a script I use to "build" the Human Genome: it creates the necessary genomic data structures that I need to run a DNA sequencing analysis.  The data structures are Burrows-Wheeler indices that the genomic aligners (Bowtie2) need to get their job done.

Read More →
Oct 25, 2016
sequencing, dna
Sep 20, 2016
Deep Learning
Deep Learning Videos
Sep 20, 2016
Deep Learning

I found this great channel by professor Nando de Freitas at the University of Oxford.  Most of the videos are good, but the series on Neural Networks and Deep Learning is great:

Read More →
Sep 20, 2016
Deep Learning
Aug 18, 2016
R
Upgrading R
Aug 18, 2016
R

Recently I had to upgrade my R installation because I needed to install a library that required a higher version of R than what I had installed.  I used to live life on the edge and upgrade R as soon as a new version was available, but as my third-party libs started to grow I started to upgrade R less and less.

Read More →
Aug 18, 2016
R
Jul 24, 2016
R, diagnostics
Visualization & Diagnostic Plots
Jul 24, 2016
R, diagnostics

I needed to create a series of diagnostic plots for a recent Data Mining project.  I created the plots by hand using R — I say "by hand" to mean that I wrote a script to generate them, rather than using a tool such as Tableau.  The reason is that the data for the plots came from the UCI Machine Learning Repository, and it just so happened that the particular datasets come bundled with the R standard library. :)

Read More →
Jul 24, 2016
R, diagnostics

Flint Project

  • Flint

  • GitHub Repository

  • Getting Started

  • Documentation

  • FAQ

Site Archive

  • August 2021
  • July 2019
  • May 2019
  • March 2019
  • January 2018
  • April 2017
  • October 2016
  • September 2016
  • August 2016
  • July 2016
  • June 2016
  • May 2016
  • April 2016
  • March 2016
  • February 2016
  • January 2016
  • December 2015
  • November 2015
  • October 2015
  • September 2015

Categories

  • Bioinformatics
  • Computation
  • Data Mining
  • Design
  • Machine Learning
  • Programming
  • Visualization
  • YouTube

Tags

  • Classifiers
  • Java
  • OS X
  • PhD
  • R
  • Spark
  • Tableau
  • Weka
Back To Top

© 2020 Camilo Valdes

Coded locally in Miami, FL. 🐬