Hadoop Lessons: Cascading for your next hadoop project

Cascading for your next hadoop project

Cascading is a platform for developing data applications on hadoop.It can process all types of data like structured ,unstructured and semi structured data. It can be used for most of the business analytics requirements.It is written in java on top of mapreduce.It also has different versions supporting python,ruby,clojure and scala.
in this article , I would like share few benefits if you use cascading in your big data projects.

1. Need not think in terms of keys and values

Biggest problem of using mapreduce is thinking in terms of keys and values apart from business logic.
Map reduce is very low level API,I feel, most fo times,developing data applications using mapreduce is same as studying mechanical engineering for learning driving.that is the reason mapreduce based tools like hive and pig are widely adopted .for the same reason ,Cascading can also be used.you need not think in terms of key value programming paradigm,you can focus on business logic.

2. Pure java

When we use mapreduce tools like hive or pig,if you want to build complex business logic ,again you have to depend on UDFs which requires some programming languages like java or python.so rather than using Hive and java or pig and java for your project,you can depend on single tool like cascading so you can write your entire code in one programming language like java.

3. Rapid application development

In mapreduce ,you will write sparate program for mapper , separate program for reducer and one driver program,so you will write more lines of code.
in cascading ,you will write only business logic and you will have less number of lies of code.as you will also have built in functions ,you can rapidly develop data applications.in mparreduce you dont have any concept of built in analytical functions and you end up writing lot of code.

4.Customizable

Though It is built on top of Mapreduce ,it allows you to customize API as per user requirements.

5.Easy Integration

We have many technologies in big data space like hadoop,hive,sqoop,oozie,cassandra,hbase,solr,elasticsearch,teradata,splunk and rdbms systems like oracle,mysql and postgres.fortunately cascading provides easy facility to integrate with all of them.
I mean integration with other technologies is also easy.

6. Proven in production

It is being used by many companies including Twitter.

7.Very good documentation

Cascading provides good documentation in terms of tutorials and user guide.
you can easily start learning the same,It might not take more than one week to start your own application.

8.Testable code

Last but not least ,if we go for hive or Pig you many not able test your code but Cascading is also suitable for test driven developments.
you can confidently deliver quality applications using cascading.

With all these benefits ,I think you can easily consider Cascading for your next hadoop project.

17 comments:

tutorialcupOctober 7, 2020 at 6:36 AM
Thanks for suggesting good list. I appreciate your work this is really helpful for everyone. Get more information at Python Tutorial. Keep posting such useful information.
ReplyDelete
Replies
codingdolphinNovember 10, 2020 at 10:25 PM
I found your blog on Google and read a few of your other posts. I just added you to my Google News Reader. You can also visit Caching In Python for more Coding Dolphin related information and knowledge, Keep up the great work Look forward to reading more from you in the future.
ReplyDelete
Replies
AnonymousMay 30, 2022 at 12:51 PM
SMM PANEL
SMM PANEL
https://isilanlariblog.com
instagram takipçi satın al
Hirdavatciburada.com
beyazesyateknikservisi.com.tr
servis
tiktok jeton hilesi
ReplyDelete
Replies
AnonymousJune 5, 2022 at 9:06 AM
maltepe toshiba klima servisi
kadıköy toshiba klima servisi
maltepe beko klima servisi
kadıköy beko klima servisi
kartal lg klima servisi
ümraniye lg klima servisi
kartal alarko carrier klima servisi
ümraniye alarko carrier klima servisi
kartal daikin klima servisi
ReplyDelete
Replies
AnonymousJune 28, 2022 at 2:33 AM
en son çıkan perde modelleri
nft nasıl alınır
yurtdışı kargo
en son çıkan perde modelleri
uc satın al
özel ambulans
minecraft premium
lisans satın al
ReplyDelete
Replies
betturkeyDecember 26, 2022 at 4:13 AM
Success Write content success. Thanks.
betmatik
betpark
kıbrıs bahis siteleri
canlı poker siteleri
canlı slot siteleri
deneme bonusu
kralbet
ReplyDelete
Replies
sportsbetJanuary 2, 2023 at 8:25 PM
Good content. You write beautiful things.
sportsbet
korsan taksi
mrbahis
hacklink
mrbahis
taksi
vbet
hacklink
vbet
ReplyDelete
Replies
sportsbetJanuary 5, 2023 at 5:48 AM
This post is on your page i will follow your new content.
mrbahis giriş
sportsbetgiris.net
mrbahis
casino siteleri
betgaranti.online
mrbahis.co
casino siteleri
sportsbet giriş
sportsbet
ReplyDelete
Replies
BayFebruary 16, 2023 at 8:58 AM
elf bar
binance hesap açma
sms onay
KD17S
ReplyDelete
Replies
ZeyMarch 16, 2023 at 12:36 PM
betmatik
kralbet
betpark
tipobet
slot siteleri
kibris bahis siteleri
poker siteleri
bonus veren siteler
mobil ödeme bahis
E7İ60M
ReplyDelete
Replies
polatJuly 28, 2023 at 12:41 PM
bayrampaşa
güngören
hakkari
izmit
kumluca
07ZZ
ReplyDelete
Replies
TunçAugust 3, 2023 at 5:26 PM
salt likit
salt likit
ZN8PQC
ReplyDelete
Replies
aliAugust 5, 2023 at 12:05 PM
artvin
bitlis
niğde
hatay
tunceli

A3MY1
ReplyDelete
Replies
baranAugust 29, 2023 at 2:45 PM
https://saglamproxy.com
metin2 proxy
proxy satın al
knight online proxy
mobil proxy satın al
JEG5T8
ReplyDelete
Replies
emirSeptember 7, 2023 at 3:54 AM
https://saglamproxy.com
metin2 proxy
proxy satın al
knight online proxy
mobil proxy satın al
30YR
ReplyDelete
Replies
İlaydaSeptember 9, 2023 at 12:48 PM
https://saglamproxy.com
metin2 proxy
proxy satın al
knight online proxy
mobil proxy satın al
GN6
ReplyDelete
Replies
YaşarSeptember 14, 2023 at 6:10 PM
burdur
bursa
çanakkale
çankırı
çorum
denizli
diyarbakır
5FPVB
ReplyDelete
Replies

Subscribe to: Post Comments (Atom)