Asimo Kumar Mallick
ID: CCE120368
Asimo Kumar Mallick
ID: CCE120368
Login to view
Login to view
- Age: 25 years
- West Bengal, India
- Login to view Complete Address
Asimo Kumar Mallick
ID: CCE120368

About Asimo Kumar Mallick
1. Practical experience in big data storage, querying, processing, and analysis using modern Hadoop and real-time data technologies. 2. Hadoop tools such as HDFS, Hive, Sqoop, Apache Spark, and AWS. 3. Proficient in DataFrame operations for processing large structured and semi-structured datasets, including filtering, mapping, aggregation, and grouping. 4. Developed custom data transformation workflows using PySpark, leveraging AWS S3 as the primary storage layer. 5. Proficient in Hive, including the creation and management of managed, external, and partitioned tables. 6. Deep understanding of Hive file formats such as ORC, Parquet, and Avro, along with the ability to choose appropriate formats based on performance and storage needs. 7. Developed Spark-based ETL pipelines using RDDs in Scala, Java, and Python for distributed data processing. 8. Experienced in fine-tuning RDD performance via configuration optimization, including memory management, caching strategies.
Employment History
Kinetic Green Energy and Power Solutions LimitedInvite company
16th Feb 2023 to 2nd Mar 2025
Engineer
16th Feb 2023 to 2nd Mar 2025
Salary Package
Salary Hidden
Roles & Responsibility
1. Practical experience in big data storage, querying, processing, and analysis using modern Hadoop and real-time data technologies. 2. Proficient in DataFrame operations for processing large structured and semi-structured datasets, including filtering, mapping, aggregation, and grouping. 3. Developed custom data transformation workflows using PySpark, leveraging AWS S3 as the primary storage layer. 4. Deep understanding of Hive file formats such as ORC, Parquet, and Avro, along with the ability to choose appropriate formats based on performance. 5. Developed Spark-based ETL pipelines using RDDs in Python for distributed data processing. 6. Experienced in fine-tuning RDD performance via configuration optimization, including memory management, caching strategies.
Skills
- Apache Spark
- Big Data
- Hadoop
- Hive
- MapReduce
- MySQL
- Python
- Snowflake
- data processing
- pyspark
- AWS
- Airflow
- S3 Storage
- cloud orchestration
- data pipeline
Verification Pending
Education
Btech in mechanical engineering
(Highest)Biju Patnaik University of Technology, Odisha
11th Aug 2018 to 17th Jul 2022
Supporting Documents
Education document hidden
Expertise
pyspark
4/5
Hadoop
4/5
data pipeline
4/5
AWS
3/5
Hive
3/5
MySQL
3/5
Airflow
3/5
Python
2/5
Azure Cloud
1/5
Lingo
Bengali (India)
Verbal
4/5
Written
1/5
English (India)
Verbal
4/5
Written
5/5
Hindi
Verbal
5/5
Written
3/5
Marathi
Verbal
2/5
Written
1/5
Oriya
Verbal
5/5
Written
1/5

