<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
    <channel>
        <title>vladsiv</title>
        <link>https://www.vladsiv.com</link>
        <description>Technical blog on data science, cloud systems, and coding - with art, musings, and essays on tech's creative side</description>
        <lastBuildDate>Wed, 22 Apr 2026 23:11:41 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>https://github.com/jpmonette/feed</generator>
        <language>en</language>
        <copyright>CC BY 4.0 | 2026 vladsiv</copyright>
        <item>
            <title><![CDATA[The Drawbacks of using AWS SageMaker Feature Store]]></title>
            <link>https://www.vladsiv.com/posts/drawbacks-of-aws-sagemaker-feature-store</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/drawbacks-of-aws-sagemaker-feature-store</guid>
            <pubDate>Tue, 25 Nov 2025 00:00:00 GMT</pubDate>
            <description><![CDATA[This post breaks down the key drawbacks of SageMaker Feature Store that teams often encounter when scaling ML workflows. By understanding these limitations early, you can make more informed architectural decisions and avoid unnecessary complexity later on.
]]></description>
        </item>
        <item>
            <title><![CDATA[Installing Packages from a Private PyPI Repository on Databricks Clusters]]></title>
            <link>https://www.vladsiv.com/posts/databricks-private-pypi-repository</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/databricks-private-pypi-repository</guid>
            <pubDate>Thu, 21 Aug 2025 00:00:00 GMT</pubDate>
            <description><![CDATA[This post walks through setting up and managing access to a private PyPI repository in Databricks. Using Azure DevOps Artifacts as an example, we'll show how to configure the connection and install packages - an approach that works for any private repository.
]]></description>
        </item>
        <item>
            <title><![CDATA[Customizing Target Deployments in Databricks Asset Bundles]]></title>
            <link>https://www.vladsiv.com/posts/databricks-dab-target-customization</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/databricks-dab-target-customization</guid>
            <pubDate>Thu, 03 Apr 2025 00:00:00 GMT</pubDate>
            <description><![CDATA[Exploring how to structure a DAB project for greater flexibility and scalability by leveraging target-specific resource definitions. While ensuring each deployment environment receives precisely the resources it needs while keeping configurations modular and manageable.
]]></description>
        </item>
        <item>
            <title><![CDATA[Exploring and Understanding Feature Stores]]></title>
            <link>https://www.vladsiv.com/posts/understanding-feature-stores</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/understanding-feature-stores</guid>
            <pubDate>Fri, 14 Feb 2025 00:00:00 GMT</pubDate>
            <description><![CDATA[Many organizations rush into MLOps without a structured approach, leading to fragmented infrastructure and duplicated efforts. In this post, we'll explore how Feature Stores help streamline ML workflows, ensuring scalability and efficiency.
]]></description>
        </item>
        <item>
            <title><![CDATA[Certification Exam Resources]]></title>
            <link>https://www.vladsiv.com/posts/certification-exam-resources</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/certification-exam-resources</guid>
            <pubDate>Fri, 27 Dec 2024 00:00:00 GMT</pubDate>
            <description><![CDATA[I've put together a collection of resources I've used while preparing for various certification exams - covering everything from AWS to Databricks. The focus is mainly on data engineering and machine learning topics.
]]></description>
        </item>
        <item>
            <title><![CDATA[Handle Irregular Bursts of Files using EventBridge and Glue Workflow]]></title>
            <link>https://www.vladsiv.com/posts/s3-eventbridge-glue-trigger</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/s3-eventbridge-glue-trigger</guid>
            <pubDate>Thu, 04 Jul 2024 00:00:00 GMT</pubDate>
            <description><![CDATA[Exploring ways of handling irregular and sudden bursts of multiple files for data processing using event driven architecture on AWS. This blog posts showcases how to use S3 notification with EventBridge to trigger a Glue Workflow that has number of events and batch window trigger conditions.
]]></description>
        </item>
        <item>
            <title><![CDATA[Personal Project - pytransflow]]></title>
            <link>https://www.vladsiv.com/posts/pytransflow</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/pytransflow</guid>
            <pubDate>Fri, 28 Jun 2024 00:00:00 GMT</pubDate>
            <description><![CDATA[I'm thrilled to present pytransflow, a Python library I developed in my free time. pytransflow simplifies record-level processing through transformation flows defined in YAML files. I hope you find this library engaging and that it sparks your interest to both use and contribute to it.
]]></description>
        </item>
        <item>
            <title><![CDATA[Nginx Reverse Proxy and Lua Scripting]]></title>
            <link>https://www.vladsiv.com/posts/nginx-reverse-proxy-lua-scripting</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/nginx-reverse-proxy-lua-scripting</guid>
            <pubDate>Tue, 09 Apr 2024 00:00:00 GMT</pubDate>
            <description><![CDATA[Exploring the implementation of Lua scripting for dynamically altering API requests in an Nginx Reverse Proxy. This investigation opens up possibilities to write and run dynamic content using Lua scripts directly within the Nginx server, making it a powerful tool for web applications.
]]></description>
        </item>
        <item>
            <title><![CDATA[S3 Batch Operations - Lambda]]></title>
            <link>https://www.vladsiv.com/posts/s3-batch-operations-lambda</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/s3-batch-operations-lambda</guid>
            <pubDate>Wed, 20 Dec 2023 00:00:00 GMT</pubDate>
            <description><![CDATA[A brief guide outlining the process of setting up and running S3 Batch Operations Jobs with Lambda integration.
]]></description>
        </item>
        <item>
            <title><![CDATA[Exploring Pytest Fixtures: Notes and Examples]]></title>
            <link>https://www.vladsiv.com/posts/pytest-fixtures-notes-examples</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/pytest-fixtures-notes-examples</guid>
            <pubDate>Tue, 31 Oct 2023 00:00:00 GMT</pubDate>
            <description><![CDATA[Here, I present a compilation of notes and practical scenarios drawn from my experiences, demonstrating the effective utilization of pytest fixtures. These examples provide valuable insights into leveraging fixtures to refine and improve the architecture of your testing module.
]]></description>
        </item>
        <item>
            <title><![CDATA[Fine-Tuning Glue Export File Size for Athena Queries]]></title>
            <link>https://www.vladsiv.com/posts/glue-optimal-filesize</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/glue-optimal-filesize</guid>
            <pubDate>Sun, 01 Oct 2023 00:00:00 GMT</pubDate>
            <description><![CDATA[Exploring different strategies for fine-tuning the output file size in AWS Glue and consolidating small files during post-processing. By implementing these techniques, you'll not only enhance the efficiency of Athena queries but also significantly reduce the cost associated with querying large datasets.
]]></description>
        </item>
        <item>
            <title><![CDATA[Exploring AWS CloudWatch Alarms]]></title>
            <link>https://www.vladsiv.com/posts/cloudwatch-alarms</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/cloudwatch-alarms</guid>
            <pubDate>Sun, 11 Jun 2023 00:00:00 GMT</pubDate>
            <description><![CDATA[Exploring the functionality of AWS CloudWatch alarms, understanding their operation, configuration, and practical application within CDK applications. Learn to define and customize alarms, including adjusting periods, evaluation ranges, and handling missing data, to ensure robust monitoring and effective response mechanisms.
]]></description>
        </item>
        <item>
            <title><![CDATA[Personal Project - Automating numerical calculations and implementing ML models]]></title>
            <link>https://www.vladsiv.com/posts/numerical-calculations-automation</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/numerical-calculations-automation</guid>
            <pubDate>Sat, 20 May 2023 00:00:00 GMT</pubDate>
            <description><![CDATA[The objective of this project is to develop a system enabling scientists to automate numerical calculations on remote clusters and build an internal database of calculation outcomes. It also involves training machine learning models on these calculations and seamlessly integrating them for numerical predictions.
]]></description>
        </item>
        <item>
            <title><![CDATA[Authentication and Authorization in FARM Stack using JWT]]></title>
            <link>https://www.vladsiv.com/posts/auth-in-farm-stack</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/auth-in-farm-stack</guid>
            <pubDate>Mon, 24 Apr 2023 00:00:00 GMT</pubDate>
            <description><![CDATA[Discover the fundamentals of JWT authentication and its advantages within distributed systems and microservice architectures. Explore the integration of authentication into the FARM stack, consisting of FastAPI, React, and MongoDB, utilizing JSON Web Tokens (JWT).
]]></description>
        </item>
        <item>
            <title><![CDATA[Local Python Development Environment]]></title>
            <link>https://www.vladsiv.com/posts/python-development-environment</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/python-development-environment</guid>
            <pubDate>Sun, 18 Dec 2022 00:00:00 GMT</pubDate>
            <description><![CDATA[Discover a comprehensive guide on configuring your local machine for Python projects. This guide provides an overview of the most commonly used tools throughout the development process.
]]></description>
        </item>
        <item>
            <title><![CDATA[Building OpenMPI, BLAS, LAPACK, ScaLAPACK, NetCDF, Flook, and SIESTA]]></title>
            <link>https://www.vladsiv.com/posts/build-required-libraries-and-siesta-from-source</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/build-required-libraries-and-siesta-from-source</guid>
            <pubDate>Sat, 01 Oct 2022 00:00:00 GMT</pubDate>
            <description><![CDATA[Gain insight into libraries and compile OpenMPI, BLAS, LAPACK, ScaLAPACK, NetCDF, Flook, SIESTA, and other utilities from source. Understand the process of building these libraries to customize your environment effectively.
]]></description>
        </item>
        <item>
            <title><![CDATA[PGAS and Coarray Fortran]]></title>
            <link>https://www.vladsiv.com/posts/pgas-fortran-coarray</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/pgas-fortran-coarray</guid>
            <pubDate>Tue, 02 Aug 2022 00:00:00 GMT</pubDate>
            <description><![CDATA[Exploring the PGAS paradigm and experimenting with coarrays in Fortran. Learning about the principles behind PGAS, Fortran coarrays and its applications in parallel programming.
]]></description>
        </item>
        <item>
            <title><![CDATA[Understanding systemd and creating Linux services]]></title>
            <link>https://www.vladsiv.com/posts/creating-systemd-services</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/creating-systemd-services</guid>
            <pubDate>Fri, 01 Jul 2022 00:00:00 GMT</pubDate>
            <description><![CDATA[Delve into the fundamentals of systemd, covering dependencies, unit files, and service configuration. Explore the process of configuring custom applications as systemd services. Learn how to efficiently manage and run applications using systemd within your system.
]]></description>
        </item>
        <item>
            <title><![CDATA[Dynamic generation of multiple CI/CD parent-child pipelines using GitLab]]></title>
            <link>https://www.vladsiv.com/posts/gitlab-parent-child-pipelines</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/gitlab-parent-child-pipelines</guid>
            <pubDate>Wed, 01 Jun 2022 00:00:00 GMT</pubDate>
            <description><![CDATA[Set up GitLab CI/CD locally for easier experimentation and testing. Investigate methods for creating nested parent-child pipelines and explore the process and advantages of implementing this approach. Learn how to streamline your development workflow with nested pipelines for better organization and efficiency.
]]></description>
        </item>
        <item>
            <title><![CDATA[SageMaker Serverless Inference using BYOC]]></title>
            <link>https://www.vladsiv.com/posts/sagemaker-serverless-inference-byoc</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/sagemaker-serverless-inference-byoc</guid>
            <pubDate>Wed, 04 May 2022 00:00:00 GMT</pubDate>
            <description><![CDATA[Discover techniques for deploying custom models within Docker images using SageMaker and serverless inference. Explore the functionalities and benefits of each approach. Learn how to efficiently deploy your models for scalable and efficient inference.
]]></description>
        </item>
        <item>
            <title><![CDATA[Understanding Big Data File Formats]]></title>
            <link>https://www.vladsiv.com/posts/big-data-file-formats</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/big-data-file-formats</guid>
            <pubDate>Fri, 01 Apr 2022 00:00:00 GMT</pubDate>
            <description><![CDATA[Dive into the structure of popular Big Data file formats like Parquet, Avro, and ORC. Understand their unique features and advantages. Learn how these formats optimize data storage and processing.
]]></description>
        </item>
        <item>
            <title><![CDATA[DynamoDB Stream, Lambda, and S3 - Local Setup]]></title>
            <link>https://www.vladsiv.com/posts/local-dynamodb-stream-lambda-s3</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/local-dynamodb-stream-lambda-s3</guid>
            <pubDate>Tue, 01 Mar 2022 00:00:00 GMT</pubDate>
            <description><![CDATA[Create a simple application that utilizes DynamoDB Stream, Lambda, and S3 services. Set it up locally for easy development, testing, and experimentation. This setup demonstrates how these AWS services can work tosgether.
]]></description>
        </item>
        <item>
            <title><![CDATA[Achieving Scalable Multilingual Semantic Search]]></title>
            <link>https://www.vladsiv.com/posts/achieving-scalable-multilingual-semantic-search</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/achieving-scalable-multilingual-semantic-search</guid>
            <pubDate>Tue, 01 Feb 2022 00:00:00 GMT</pubDate>
            <description><![CDATA[Understand the basics of seq2seq architecture and artificial neural networks (ANNs). Learn about multilingual models and their applications. Discover how to use these technologies to achieve scalable multilingual semantic search.
]]></description>
        </item>
        <item>
            <title><![CDATA[DICOM File Processing]]></title>
            <link>https://www.vladsiv.com/posts/dicom-file-processing</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/dicom-file-processing</guid>
            <pubDate>Sat, 01 Jan 2022 00:00:00 GMT</pubDate>
            <description><![CDATA[Discover how to handle and process DICOM files. Explore popular free and open-source libraries that can help you develop applications for efficient DICOM processing. These tools and libraries make managing medical images much easier and straightforward.
]]></description>
        </item>
        <item>
            <title><![CDATA[DICOM File Format Basics]]></title>
            <link>https://www.vladsiv.com/posts/dicom-file-format-basics</link>
            <guid isPermaLink="false">https://www.vladsiv.com/posts/dicom-file-format-basics</guid>
            <pubDate>Wed, 01 Dec 2021 00:00:00 GMT</pubDate>
            <description><![CDATA[Explore the fundamentals of the DICOM file format! This quick introduction covers the basics of DICOM's structure, its essential uses, and tips for easily navigating its complex and abstract components.
]]></description>
        </item>
    </channel>
</rss>