I need a complete guidance on restructuring a data, using a streaming lining data technique .

Question

#BigData #datascinece #machinelearing #spark #hadoop #HDFS #tech

Vijayakumar Ramdoss Engineer 2 Answers Boston, Massachusetts · Answer 1 · Dec 22, 2023

Updated Dec 22, 2023

Vijayakumar’s Answer

Kafka and Spark would be better solutions for this requirement,

Login to comment

Udaya Dintyala AVP 1 Answer Hyderabad, Telangana, India · Answer 2 · Dec 22, 2023

Updated Dec 22, 2023

Udaya’s Answer

Restructuring data options depend on complexity of restructuring (example: Convert speech to text) , data set size, latency needs. The options are Hadoop, Spark, Kafka and if one wants to do without much setup, one can use cloud tools such as Dataproc or Snowflakes and associated AI function APIs. If you provide problem statement clearly then end to end high level steps & tools can be given.

Login to comment

Anuj Agrawal Engineering 3 Answers Phoenix, Arizona · Answer 3 · Dec 22, 2023

Updated Dec 22, 2023

Anuj’s Answer

There's a lot of factors to consider, but you should look into Kafka and Spark which are both open source.

Login to comment

score 0 · Answer 4 · 2019-03-08T19:03

Updated Mar 08, 2019

Jane’s Answer

It depends on the scale of the data and your SLA requirements. Let's say you want to reconstruct a small dataset in memory. If you are a Java or Scala developer, You can leverage Java 8 Lamda or Scala to manipulate data using functions like map, flatmap, reduce, and etc.

But if your dataset will be in TB and low latency data processing is required, then you need to consider to build a large scale and distributed streaming pipelines. The following tech stack may help you to initiate your evaluation:

Kafka (for pub/sub)

Kinesis (for pub/sub)

Storm

Spark Streaming (for compute)

Hope it helps!

Thank you so much, Jane! We so appreciate you joining us remotely and sharing your technical expertise. PS- Happy International Women's Day! yoonji KIM, Admin Mar 08, 2019

Login to comment

Saad Abdul

I need a complete guidance on restructuring a data, using a streaming lining data technique .

4 answers

Vijayakumar Ramdoss

Vijayakumar’s Answer

Udaya Dintyala

Udaya’s Answer

Anuj Agrawal

Anuj’s Answer

Jane chen

Jane’s Answer

Internet Explorer Detected!

Edit your affiliations

Saad Abdul

Share a link to this question

Share a link to this question

I need a complete guidance on restructuring a data, using a streaming lining data technique .

4 answers

Follow discussion

Vijayakumar Ramdoss

Share a link to this answer

Share a link to this answer

Vijayakumar’s Answer

Udaya Dintyala

Share a link to this answer

Share a link to this answer

Udaya’s Answer

Anuj Agrawal

Share a link to this answer

Share a link to this answer

Anuj’s Answer

Jane chen

Share a link to this answer

Share a link to this answer

Jane’s Answer

Related Questions

What is the 1 book you would suggest everyone reads in their lifetime?

I'm making it a personal goal to read for 30 minutes daily again, and am looking for some quality material. Anything related to science, technology, or woman's history are very interesting to me. #college #engineering #science #technology #tech #women-in-tech #reading #women-in-engineering #books

What should you study if you're a girl and want to work in tech?

Is it harder for girls to work in technology than boys? #technology #tech #women-in-tech

What advice do you have for students applying for entry-level roles as recent graduates amid the COVID-19 pandemic?

#graduate #career #resume #stem #job #compsci #first-job #hiring #computer_science #engineering #tech #civil-engineering #COVID-19