May 2024
M	T	W	T	F	S	S
	1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Posts Tagged ‘Java’

Spark: Real time analytics for big data for top search queries and top product views

Posted by Jai on June 4, 2014

Hadoop being the batch processing framework makes it a little hard to get the real time analytics for big data. Apache Spark overcomes this batch nature and provides distributed computation capabilities and events processed in streaming fashion. In this post, we will cover to explore Spark streaming capability to process Flume Events data to generate Top search query strings generated in last an hour or top product views in the last one hour.

In continuation to the previous posts on

We have so far utilized the Hadoop system batching capabilities to process huge amount of data. But the overall batching operation makes it a bit of latency issue depending on your data. This is where Spark comes into picture. We will explore Spark streaming capability here to get some real time analytics and those can be used on the website for display purpose or for monitoring purpose.

Spark

Apache spark “is a fast and general engine for large-scale data processing.”

Functionality

As shared in other above exmaples, we have the customer search clicks data available to us. We have Flume system in place to process the data and store in Hadoop for later processing perspective. Take a scenario, you want to display real time customer behavior on the website, how other customers are doing

What other customers searching?
Other customers also searching for…
Top search query string on the website in last an hour
What other customers viewing?
Other customers also viewing products…
Top product views in the last an hour

Read the rest of this entry »

Posted in Flume, Hadoop, Java, Spark | Tagged: Apache Spark, Flume, Hadoop, Java, Spark, Spark Streaming | 2 Comments »

ElasticSearch: Indexing setup using Akka tutorial

Posted by Jai on March 21, 2014

Find ElasticSearch tutorial on github using Akka with test cases. The tutorial covers the search indexing setup using Akka.

Github

elasticsearch-akka repository uploaded on github explaining the ElasticSearch indexing setup using Akka Actors usage with test cases.

URL: https://github.com/jaibeermalik/elasticsearch-akka

README

Setup Index

Read the rest of this entry »

Posted in Akka, ElasticSearch, Git, Java, Scala | Tagged: Akka, Elasticse, elasticsearch-akka, Github, Java | 1 Comment »

ElasticSearch: Learn Java API usage with test cases

Posted by Jai on January 31, 2014

Find ElasticSearch tutorial on github using Java API with test cases.

Github

elasticsearch-tutorial repository uploaded on github explaining the ElasticSearch Java API usage with test cases.

URL: https://github.com/jaibeermalik/elasticsearch-tutorial

README

Following ElasticSearch concepts have been covered.
Read the rest of this entry »

Posted in ElasticSearch, Git, Java | Tagged: ElasticSearch, ElasticSearch Java API, elasticsearch-tutorial, Github, Java | 3 Comments »

Tips&Tricks: How to set web service url end point

Posted by Jai on October 11, 2013

A quick tip to change the url end point for a webservice. Keep the web service end point as configurable to be able to use it under different test/demo/live environments.

WebClientTestService service = new WebClientTestService() ;
String serviceEndPoint = "http://localhost/soap/testEndPoint";
Map<String, Object> context = ((BindingProvider)service).getRequestContext();
//Set service end point
context.put(BindingProvider.ENDPOINT_ADDRESS_PROPERTY, serviceEndPoint);

Keep the service end point etc. configurations as system property or pick from properties file to make it configuration in the system to be able to use different test/demo/live environment without changes the client code.

Posted in Java, Tips&Tricks, Web Services | Tagged: Java, Web Service | 1 Comment »

Email domain validation in java on server side

Posted by Jai on October 10, 2013

Email Address name and domain name validation on client side or server side are import part of maintaining quality customer data. There are plenty of utilities to allow you to validate the valid name formatting for the same. This post covers to check if the domain name used in the given email address actually valid/exists or not using MX records.

Email Address name validation

Email Address regular expression validation covers the valid regular expression matching.

Apache commons library Email validator covers the email name validation in java.

Check How To Validate Email Address With Regular Expression to find the matching regular expression for Email Address.

Email domain MX record validation

The post Java Extended Email Validation Using DNS MX Lookup covers to validate the MX records for the email domain.

dnsjava library allows you do the real dns validation on java side.

Maven dependency

Read the rest of this entry »

Posted in Java | Tagged: Code, Email Domain, Java, MX records | 1 Comment »

« Previous Entries

	Exploring Enterprise… on Oozie: Scheduling Coordinator/…
	Exploring Enterprise… on ElasticSearch-Hadoop: Indexing…
	Exploring Enterprise… on Flume: Gathering customer prod…
	Exploring Enterprise… on Customer product search clicks…
	Exploring Enterprise… on ElasticSearch: Indexing setup…
	Exploring Enterprise… on ElasticSearch: Learn Java API…
	Exploring Enterprise… on ElasticSearch: Boosting score…
	Exploring Enterprise… on ElasticSearch: Text analysis f…
	Exploring Enterprise… on ElasticSearch: Faceted Search…
	Exploring Enterprise… on Getting started with Elas…

Jai’s Weblog – Tech, Security & Fun…

Tech, Security & Fun…

Jaibeer Malik

Subscribe

Feedburner

Email Subscription

Archives

Categories

Stats

Live Traffic

Books

Posts on:

Top Posts

Recent Comments

Follow me on Twitter

Interesting Links

Follow me on FriendFeed

Posts Tagged ‘Java’

Spark: Real time analytics for big data for top search queries and top product views

Spark

Functionality

ElasticSearch: Indexing setup using Akka tutorial

Github

README

Setup Index

ElasticSearch: Learn Java API usage with test cases

Github

README

Tips&Tricks: How to set web service url end point

Email domain validation in java on server side

Email Address name validation

Email domain MX record validation

Maven dependency

Tech, Security & Fun…

Jaibeer Malik

Subscribe

Feedburner

Email Subscription

Archives

Categories

Stats

Live Traffic

Books

Posts on:

Top Posts

Recent Comments

Posts Tagged ‘Java’

Spark

Functionality

Share this:

Github

README

Setup Index

Share this:

Github

README

Share this:

Share this:

Email Address name validation

Email domain MX record validation

Maven dependency

Share this: