Big Data Analytics and Management Lab (Director: Dr. Latifur Khan)

Funding Agencies: NSF, MRI, Air Force DURIP and SUN equipment grant.


Location: Engineering and Computer Science Building, ECSS 3.614

Equipment: (On Campus)

The equipment primarily consists of:
A) Servers:

Three Intel SGX server hardware i7-6700 CPU (Skylake) and 64GB RAM operating at 3.40GHz with 8 cores, running Ubuntu 14.04. Intel® Software Guard Extensions (Intel® SGX) is an Intel technology for application developers to protect sensitive code and data from modification and theft.

DELL Precision Tower 7910 Intel® Xeon(R) CPU E5-2603 v3 @ 1.60GHz × 6 cores 15.6 GB RAM 648.8 GB HD

Three Dell T7500 precision workstations running Ubuntu 13.04 with 8 core Intel Xeon E5620 processor, 16 GB of RAM and 2 TB SATA + 128 GB SSD. Additionally

Two Dell T7500 precision workstations – Windows-7, Intel Xeon(R) CPU-1.60GHz, RAM-48GB, HDD-500GB

One Dell T3400 precision workstations – Windows-7, Intel Core(TM) to Quad-2.50 GHz, RAM-8GB, HDD- 2TB

Six desktop pc – Windows-7, Intel Pentium 4, RAM-2GB, HDD-100GB

Three ESX hosts which are Dell Poweredge R720's with 12 cores 2.99GHZ, 128GB of ram, and fiber to the 3PAR SAN.

In addition, the laboratory consists of SUN servers, RAID storage and SUN workstations.

B) Hadoop Cluster:

Hadoop cluster consisting of 8 nodes of Intel(R) Xeon(R) CPU E5-2670 v3 @ 2.30GHz * 4 cores 32 GB RAM each, 1.46 TB HD capacity.

C) Spark Cluster

Five Dell Poweredge T620’s with 12 core 2.40GHz, 64 GB RAM, 4TB HD, OS CenOS 64 bit

(Off Campus)

Our off campus computing resource is provided by Texas Advanced Computing Center (TACC), an initiative for helping educational research funded by NSF. We use their JetStream Cloud to run necessary system developed in collaboration with Department of Political Science here at UTD under RIDIR grant, funded by NSF. That system includes

Each physical node in the system is running CentOS on Dell M630. Details can be found here at


Softwares Developed:

Most of our softwares are opensource and hosted in the github repository here

We have developed several softwares under Open Event Data Alliance (OEDA) and those can be found in the github repository under

Some of the project summeries are listed below.  This projects are developed in collaboration with Department of Political Science here at UTD and  involve text analytics and large scale data processing.

The follwoing projects concentrate on Stream Data Analytics 

The follwoing projects invlove Data Analytics.