Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Worried of slow pandas with large-scale Data Analysis? Embrace FireDucks! #286

Open
qsourav opened this issue Aug 5, 2024 · 3 comments
Open
Assignees
Labels
online/hybrid preferred The speaker is not available for a physical meetup. proposal want to talk at pydelhi

Comments

@qsourav
Copy link

qsourav commented Aug 5, 2024

Title

Worried of slow pandas with large-scale Data Analysis? Embrace FireDucks!

Describe your Talk

In this talk, I will introduce FireDucks, a revolutionary compiler-accelerated DataFrame Library for Python that seamlessly integrates with the pandas API. It offers high performance (even on CPU), high compatibility with pandas, sustainability, cost-effectiveness, and reliability, making it an ideal choice for large-scale data analysis using pandas.

Key features of FireDucks include:

  • Multithreaded processing for increased efficiency
  • JIT (Just-In-Time) compilation for optimized performance
  • Seamless integration with Pandas API
  • Comprehensive benchmarks for performance evaluation

Agenda:

  • Common problems with large-scale pandas workload
  • Introduction of FireDucks and its capabilities
  • Demonstrations
  • Q&A Session

Pre-requisites & reading material

Basic knowhow of pandas will be fine.

Time required for the talk

30-40 mins

Link to slides/demos

https://fireducks-dev.github.io/files/20240716_FireDucksIntro_AustinPython.pdf

About you

Sourav has 11+ years of professional experience at NEC Corporation (based in Tokyo, Japan) in the diverse fields of High-Performance Computing, Distributed Programming, Compiler Design, and Data Science. Currently, his team at NEC R&D Lab, Japan, is researching various data processing-related algorithms. Blending the mixture of different niche technologies related to compiler framework, high-performance computing, and multi-threaded programming, they have developed a Python library named FireDucks with highly compatible pandas APIs for DataFrame-related operations. In his previous engagements, he has worked in research and development of performance-critical AI and Big Data solutions, optimization of several legacy applications related to weather prediction, earth-quake simulation, etc., written in C++ and Fortran.

Availability

any day should work for me

Any comments

I am based in Tokyo, Japan. So, if there is an online or hybrid meetup session sometime, I can present my talk.

@qsourav qsourav added the proposal want to talk at pydelhi label Aug 5, 2024
@pulsar17
Copy link
Member

pulsar17 commented Aug 6, 2024

Hi @qsourav, thanks for proposing this!

In the past, it has been difficult to organize hybrid events due to lack of resources. If we do do one in the future, someone will definitely reach out to you.

@pulsar17 pulsar17 added the online/hybrid preferred The speaker is not available for a physical meetup. label Aug 6, 2024
@qsourav
Copy link
Author

qsourav commented Aug 7, 2024

Hey @pulsar17, thank you very much for your kind response. I understood the same. Let’s catch up if you are planning to attend PyCon India event this September.

@pulsar17
Copy link
Member

pulsar17 commented Aug 9, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
online/hybrid preferred The speaker is not available for a physical meetup. proposal want to talk at pydelhi
Projects
None yet
Development

No branches or pull requests

3 participants