Tech Index

NLP: natural language processing
  1. The Art of Tokenization
  2. Part-of-Speech Lookup Tables and Frequency Distributions
  3. Jaccard Similarity
  4. Phonetic Algorithms
  5. TF/IDF with Google n-Grams and part-of-speech Tags
  6. Well, well, well ... Word Sense Disambiguation (WSD) with Google n-Grams
  7. A Visual Introduction to Phrase Structure and Dependency Parsing
  8. Same but not the Same: Homonym Frequency Distributions
  9. Auto-Generation of Regular Expressions for Spelling Errors
  10. Snowball Stemmer for Java
  11. SKOS Recipe for Stemming
  12. Language Modeling
  13. Natural Language Understanding of Unstructured Data
  14. Precision vs Recall in Dictionary Matching
  15. [PPT] Natural Language Classification on Bluemix
  16. Triple Extraction
    1. Straight-up Triples (1-6)
    2. Intransitive Action (2-6)
    3. Modeling Business Scenarios (3-6)
    4. Modeling Perception (4-6)
    5. Modeling Characteristics (5-6)
    6. Spatial Modeling (6-6)
    7. Extracting IS-A Relationships
  17. Alchemy API:
    1. Using the Taxonomy Tagger and Source Code

Provenance: a digital record
  1. W3C Provenance Standard (I was a member of the working group)
  2. W3C Provenance Model Primer
  3. PROV and Dublin Core
  4. Provenance and Compliance Risk Management
  5. Reification and Trust
    1. [PPT] Things and Strings
  6. [PPT] PROV-O Overview

Machine Learning: learning from data
  1. The Road to Gamma
  2. The Chain Rule of Probability
  3. Octave: A Simple Prototyping tool for Machine Learning

Graphs: entities and relationships
  1. [PPT] Ontologies and Semantic Web
  2. Inference using OWL 2.0 Semantics
  3. Semantic Relationships in Wordnet
  4. Distinguishing Parts from Kinds
  5. BGS Open Data Review:
    1. Geochronology Ontologies
    2. Earth Material Ontologies
  6. NLP-Driven Ontology modeling for Ontology-driven NLP
  7. Ontology-driven Natural Language Processing
  8. An Omonastic Compilation and Ontology
  9. Model Simplification through Dependency Reinjection
  10. Neo4j
    1. Neo4j and Python on Ubuntu (using VirtualBox)

Jena: a java system for rdf
  1. Jena: A Semantic Web Framework (Introduction)
  2. The Jena Schemagen Tool
  3. The Jena Layer Cake

Apache Hadoop: reliable, scalable cluster computing
  1. Software Installation
    1. Windows:
      1. Virtualbox for Virtualization
      2. Installing PuTTY
    2. Linux:
      1. Installing Java
      2. for Development Environments only:
        1. Installing Maven
        2. Installing Eclipse
        3. Develop your Environment
  2. Creating the Cluster
    1. Bring the Binaries
    2. Secure the Channel
    3. Configure the Base
    4. Cloning for Clusters
      1. Adding a new DataNode
  3. Using the Cluster
    1. Flip the Switch
    2. Copy some Files
    3. Submit a Job
    4. Monitor your Progress
  4. Understanding the Hadoop Architecture
  5. Troubleshooting
    1. Incompatible Cluster IDs
    2. GC Overhead Limit Exceeded

Apache Spark: lightning fast cluster computing
  1. Installing Spark on Ubuntu
  2. Spark RDD (Resilient Distributed Dataset) Introduction and API
  3. Deploy a Scala Job to a Spark Cluster using SBT
  4. PySpark Recipes

Scala Lang: OO and java interoptable lang
  1. Installing Scala on Ubuntu
  2. Installing sbt on Ubuntu

Python: the Python Programming Language
  1. Installation and Configuration:
    1. Configuring a Python Development Environment on Ubuntu
    2. Exposing a Python App via Django in Vagrant
  2. MongoDB
    1. Installing the Python Driver in MongoDB
    2. Importing Data into MongoDB using Python
  3. Requests Module
    1. GET and POST with Requests
    2. Posting documents to Solr 
  4. Zeppelin and Pyspark
    1. Writing to Parquet
    2. Transform a CSV to Parquet
    3. Merge Multiples CSVs in one Dataframe
    4. Finding Associated Hashtags (Contexual Analysis)
    5. Ad-hoc Social Analysis in Twitter  
    6. Pyspark Operations Overview
    7. Pyspark Recipes 
  5. Other
    1. Using the Slack RTM API
    2. Pandas and Geonames 
    3. Using Paramiko for SSH-based operations 
    4. Using Selenium to logon to Twitter 
    5. Bulk Inserts to MySQL 

Java: general-purpose computer programming language
  1. Installing the JDK on Ubuntu
  2. Servlets
    1. Servlet Redirecting and Dispatching
    2. Servlet Mapping
  3. Spring
    1. The Spring Container for IoC (Inversion of Control) 
    2. Loading Resources from the Local File System
    3. Data Access
      1. Using the JDBC  Template
      2. Spring JPA
  4. Jersey
    1. An Introduction to Java API for RESTful Web Services (JAX-RS)
    2. Deploying a RESTful Web Service onto Tomcat (running in a Docker Container)
    3. JAX-RS and MongoDB (using Docker)
  5. Public Contributions on Github:
    1. java-commons, jaxrs-minimalist, jaxrs-mongo, snowball (maven-compatible)

Docker: an open platform for distributed apps
  1. Installing Docker
    1. on Ubuntu
    2. on Mac OS
  2. Docker Design and Architecture
  3. Creating your own Dockerfile
  4. Docker and ...
    1. Apache Tomcat
    2. Apache Solr
  5. Public Contributions on Github

MySQL: a popular open-source database
  1. MySQL Installation on Ubuntu
  2. Using the MySQL Workbench
  3. Essential Admin Tasks

Maven: a build automation tool
  1. Installing Maven
  2. Creating a Java Project with Maven
  3. Using Profiles in Maven
  4. Project Object Model (POM) Overview
  5. Multi-Workspace Structuring Best Practices
  6. Installing a Local JAR
  7. Creating an Executable JAR
  8. Deploying a Web Project (without Eclipse)
  9. Dependency Visualization
  10. Parent POMs and the Maven Reactor
  11. Public Contributions on Github (BOMs)

Tomcat: an open-source web server and servlet container
  1. Deploying to Tomcat without an IDE using Maven
  2. Docker and Apache Tomcat

MongoDB: an open-source document database
  1. Installation
    1. Installation on Ubuntu
    2. MongoDB and Docker
  2. Changing the Data Path (dbPath)
  3. Installing the RoboMongo Administrative Client
  4. An Introduction to the Aggregation Pipeline
  5. Java
    1. Using the Java Driver
    2. Using Spring Data for MongoDB
    3. JAX-RS and MongoDB
  6. Python
    1. Installing the Python Driver
    2. Importing Data using PyMongo

Lucene: a high-performance search engine library
  1. Building the Search Index
  2. Querying the Search Index
  3. Query Expansion with Wordnet Synonyms

Solr: an enterprise search platform (built on Lucene)
  1. Introduction to Solr (on Docker)
  2. Posting documents to Solr
  3. Automated Tokenization testing with Solr

Bluemix: an open-standards cloud platform
  1. Configuring a Local Development Environment
  2. Docker-in-docker Dockerfile for Deploying Docker to Bluemix
  3. [PPT] Creating a Git Project (on Bluemix)
  4. [PPT] Building a Project with Maven (on Bluemix)
  5. [PPT] Deploying a Java Web App (on Bluemix)
  6. [PPT] Training the NL Classifier on Bluemix

Github: revision control and scm
  1. Installation on Ubuntu
  2. Clone a Remote Repository

Ubuntu: debian linux distribution
  1. Mounting a Network Share

Jenkins: continuous integration
  1. Installing Jenkins (with Git and Maven) on Ubuntu 14.04
  2. Using Jenkins (with Git and Maven)
  3. [PPT] Jenkins on Docker

Eclipse: ide for java developers
  1. Installing Eclipse on Ubuntu

VirtualBox: enterprise virtualization
  1. Cloning VirtualBox VMs
  2. Adding a new Hard Drive to Ubuntu 14.04

NodeJS: javascript on the server
  1. Installing node.js on Ubuntu 14.10
  2. Installing express.js on Ubuntu 14.10
  3. Node and Express on OS X

SAS: Analytics Software & Solutions
  1. Reading a CSV
  2. PROC Means
  3. PROC Freq
  4. PROC Print


  1. I read this blog it has good information to learn more and more once check it here Devops Online Training Hyderabad To get more knowledge on Devops.

  2. Thanks for giving a great information about DevOps Good Explination nice Article
    anyone want to learn advance devops tools or devops online training
    DevOps Online Training
    DevOps Online Training hyderabad
    DevOps Training
    DevOps Training institute in Ameerpet

  3. Thank u for this information


  4. Nice blog..! I really loved reading through this article... Thanks for sharing such an amazing post with us and keep blogging...
    Devops online training
    Best Devops online training
    Devops online training in Hyderabad
    Devops online training in india


  5. I have read your article; it is very instructive and valuable to me. I admire the valuable information you offer in your articles. Thanks for posting it.
    power bi training in hyderabad
    best power bi class room training in hyderabad
    power bi class room training in hyderabad
    power bi training in india

  6. I just loved your article on the beginners guide to starting a blog.If somebody take this blog article seriously in their life, he/she can earn his living by doing blogging.thank you for thizs article. devops online training

  7. Very useful information, the post shared was very nice.
    DevOps Online Training

  8. Excellent blog I visit this blog it's really awesome. The important thing is that in this blog content written clearly and understandable. The content of information is very informative.
    DevOps Training in Chennai | DevOps Training in anna nagar | DevOps Training in omr | DevOps Training in porur | DevOps Training in tambaram | DevOps Training in velachery

  9. Thanks for sharing this information. I really Like Very Much.
    devops online training

  10. Thanks for sharing
    Leanpitch provides online training in DevOps during this lockdown period everyone can use it wisely.
    DevOps Online Training

  11. Thanks for sharing this information. I really Like Very Much.
    best devops online training

  12. SEO is the best process for ranking a webpage on Search Engine. We are the best SEO Service company in Dhaka, Bangladesh. Are you leading a small business and looking for SEO Services?
    We are the SEO Expert. We use natural SEO Strategy to rank your website on the first page and the first position of every Search Engine. Doing Local SEO, Website Analysis and Audit, On Page SEO,
    Off Page SEO, Competitor Analysis, PBN Backlink, Guest Posting, Wiki Backlink, Content Marketing, Create a backlink with long content and Website Speed increasing like as your best SEO consultant Call us Now whatsapp: +(88) 01537587949
    SEO Service
    Free bangla sex video:careful
    good post outsourcing institute in bangladesh

  13. İnstagram takipçi satın al! İnstagram takipçi sitesi ile takipçi satın al sende sosyal medyada fenomen olmaya bir adım at. Sende hemen instagram takipçi satın almak istiyorsan tıkla:

    1- takipçi satın al

    2- takipçi satın al

    3- takipçi satın al

  14. thanks for Sharing This Article.It is very so much valuable content. I hope these Commenting lists will help to my website devops Online Training
    best devops Online Training
    top devops Online Training