Home | JJ's World

MEVN stack experiment with Jupyter

Mon 15 January 2018

An experiment to see how far I can go developing an application using only a Jupyter notebook.

notebook Python Jupyter Docker Selenium full-stack VueJS ExpressJS MongoDB Mongoose Linux

Creating block diagram using Python in Jupyter notebook

Mon 18 December 2017

Using the blockdiag library of Python you can easily create block diagrams and with the magic of Jupyter's nbconvert transform it into a presentation.

notebook Python Jupyter blockdiagram presentation

Create Spark dataframe column with lag

Thu 14 December 2017

Create a lagged column in a PySpark dataframe:

from pyspark.sql.functions import monotonically_increasing_id, lag
from pyspark.sql.window import Window

# Add ID to be used by the window function
df = df.withColumn('id', monotonically_increasing_id())
# Set the window
w = Window.orderBy("id")
# Create the lagged value
value_lag = lag('value').over(w)
# Add the lagged values to a new column
df = df.withColumn('prev_value', value_lag)

Python lag pyspark dataframe

MEVN Stack - Setting up MongoDB, Express and VueJS

Tue 05 December 2017

This tutorial will go through creating an application using the MEVN stack.

VueJS ExpressJS Javascript MongoDB full-stack front-end back-end

Using Pythons filter to select columns in dataframe

Thu 09 November 2017

A simple trick to select columns from a dataframe:

# Create the filter condition
condition = lambda col: col not in DESIRED_COLUMNS
# Filter the dataframe
filtered_df = df.drop(*filter(condition, df.columns))

Python filter tool lambda

Using Vue.js in a Jupyter notebook

Thu 05 October 2017

Another experiment: using the progressive JavaScript Framework Vue.js in a Jupyter notebook.

Jupyter notebook VueJS JavaScript frontend

Hadoop Experiment - Using Pig

Tue 03 October 2017

In my previous posts I have already shown simple examples of using MapReduce and Spark with Pyspark. A missing piece moving from MapReduce to Spark is the usage of Pig scripts. This posts shows an example howto use a Pig script.

Hadoop Pig Docker Cloudera mapreduce

Running Truffle in a Docker container

Fri 29 September 2017

This is a short explanation on how to setup a Truffle decentralized app using Docker containers.

Truffle Docker container dapp Ethereum blockchain

Hadoop Experiment - Spark with Pyspark in a Jupyter notebook

Fri 22 September 2017

Last time I started to experiment with Hadoop and simple scripts using MapReduce and Pig on a Cloudera Docker container. Now lets start playing with Spark, since this is the goto language for machine learning on Hadoop.

Hadoop Spark Docker container mapreduce Python

Hadoop Experiment - MapReduce on Cloudera

Fri 08 September 2017

This post describes my first experiment with the Cloudera environment by trying to use the basic MapReduce method on a simple dataset.

Hadoop Cloudera Docker container virtualization mapreduce