<?xml version="1.0" encoding="UTF-8"?>
<rss  xmlns:atom="http://www.w3.org/2005/Atom" 
      xmlns:media="http://search.yahoo.com/mrss/" 
      xmlns:content="http://purl.org/rss/1.0/modules/content/" 
      xmlns:dc="http://purl.org/dc/elements/1.1/" 
      version="2.0">
<channel>
<title>Deven Mistry</title>
<link>https://deven367.github.io/blog.html</link>
<atom:link href="https://deven367.github.io/blog.xml" rel="self" type="application/rss+xml"/>
<description></description>
<generator>quarto-1.9.38</generator>
<lastBuildDate>Wed, 01 Oct 2025 00:00:00 GMT</lastBuildDate>
<item>
  <title>Autograder</title>
  <link>https://deven367.github.io/posts/autograder.html</link>
  <description><![CDATA[ 






<section id="background" class="level2">
<h2 class="anchored" data-anchor-id="background">Background</h2>
<p>Being a PhD student in the States usually implies that besides research, teaching (being a teaching assistant) and grading will also be a major part of your PhD workload.</p>
<p>As of Oct 2025, I am 3rd year PhD student where I’ve had to grade homeworks for various classes all the way from data structures, data analysis and even applied algorithms.</p>
<p>Assignments can range from theoretical questions all the way to implementing them in code. In such scenarios, grading around 10-15 submissions is not an issue, but as the class size grows it significantly starts cutting into your research time and time that could be used to used to do actual research.</p>
<p>Hence, this blog was born out of a necessity to me trying to get back some time for myself.</p>
<div class="callout callout-style-default callout-important callout-titled">
<div class="callout-header d-flex align-content-center">
<div class="callout-icon-container">
<i class="callout-icon"></i>
</div>
<div class="callout-title-container flex-fill">
Important
</div>
</div>
<div class="callout-body-container callout-body">
<p>The autograder mentioned here is for grading code submissions! Questions which have hand written solutions (proofs, and other theory based questions) still need to graded manually.</p>
</div>
</div>
</section>
<section id="prerequisites" class="level2">
<h2 class="anchored" data-anchor-id="prerequisites">Prerequisites</h2>
</section>
<section id="setup" class="level2">
<h2 class="anchored" data-anchor-id="setup">Setup</h2>


</section>

 ]]></description>
  <category>teaching</category>
  <category>github-actions</category>
  <category>autograder</category>
  <guid>https://deven367.github.io/posts/autograder.html</guid>
  <pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate>
</item>
<item>
  <title>Experience with Distributed Training</title>
  <dc:creator>Deven Mistry</dc:creator>
  <link>https://deven367.github.io/posts/experience-with-distributed-training/</link>
  <description><![CDATA[ 









 ]]></description>
  <category>pytorch</category>
  <category>ddp</category>
  <guid>https://deven367.github.io/posts/experience-with-distributed-training/</guid>
  <pubDate>Wed, 10 Apr 2024 00:00:00 GMT</pubDate>
</item>
<item>
  <title>Post With Code</title>
  <dc:creator>Harlow Malloc</dc:creator>
  <link>https://deven367.github.io/posts/post-with-code/</link>
  <description><![CDATA[ 






<p>This is a post with executable code.</p>



 ]]></description>
  <category>news</category>
  <category>code</category>
  <category>analysis</category>
  <guid>https://deven367.github.io/posts/post-with-code/</guid>
  <pubDate>Mon, 14 Nov 2022 00:00:00 GMT</pubDate>
  <media:content url="https://deven367.github.io/posts/post-with-code/image.jpg" medium="image" type="image/jpeg"/>
</item>
<item>
  <title>Welcome To My Blog</title>
  <dc:creator>Tristan O&#39;Malley</dc:creator>
  <link>https://deven367.github.io/posts/welcome/</link>
  <description><![CDATA[ 






<p>This is the first post in a Quarto blog. Welcome!</p>
<p><img src="https://deven367.github.io/posts/welcome/thumbnail.jpg" class="img-fluid"></p>
<p>Since this post doesn’t specify an explicit <code>image</code>, the first image in the post will be used in the listing page of posts.</p>



 ]]></description>
  <category>news</category>
  <guid>https://deven367.github.io/posts/welcome/</guid>
  <pubDate>Fri, 11 Nov 2022 00:00:00 GMT</pubDate>
</item>
<item>
  <title></title>
  <link>https://deven367.github.io/posts/2021-06-15-Cleaning text for NLP tasks.html</link>
  <description><![CDATA[ 






<section id="cleaning-text-for-nlp-tasks" class="level1">
<h1>Cleaning text for NLP tasks</h1>
<section id="some-background" class="level2">
<h2 class="anchored" data-anchor-id="some-background">Some background</h2>
<p>I started working in the field of Natual Language Processing back in August 2020. I am no expert in this field but in the past few months that I have spent my time cleaning textual data from different sources, I did manage to learn a few things and I am here to share them. These tips/suggestions are coming from someone who has had no prior experience in NLP at all. I hope whoever is reading this gets to learn something out of it. With that being said, let’s get started!</p>
</section>
<section id="reading-txt-files" class="level2">
<h2 class="anchored" data-anchor-id="reading-txt-files">Reading txt files</h2>
<p>There are a few simple parameters which people don’t often use while read txt files</p>


</section>
</section>

 ]]></description>
  <category>text preprocessing</category>
  <category>tokenization</category>
  <category>lemmatization</category>
  <category>regex</category>
  <category>nltk</category>
  <guid>https://deven367.github.io/posts/2021-06-15-Cleaning text for NLP tasks.html</guid>
  <pubDate>Tue, 15 Jun 2021 00:00:00 GMT</pubDate>
</item>
</channel>
</rss>
